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Series Preface 


The current rapid progress of research in such extremely important fields as dynamical 
systems, chaos and complexity, and nonlinear waves, combined with the steady 
development of state-of-the-art computer and graphical systems has opened new 
vistas for computational and mathematical physics. These vistas appear to be well 
beyond what was thought possible even a decade ago. In order to continue to 
stimulate and encourage these impressive developments, it is vital for applied 
mathematicians and physicists to communicate their achievements, results and ideas 
to scientists and engineers from a broad spectrum of backgrounds. The AIP Series 
in Computational and Mathematical Physics (CAMP) is intended to serve this 


purpose. 


The CAMP Series is comprised of graduate texts, monographs, and reference 
materials directed at researchers, teachers, and students in a range of pure and applied 
disciplines in order to achieve the above-stated goals. Books in the series address 
fields in computational physics such as numerical methods, novel computer hardware, 
nonlinear dynamics, and visualization. Topics in mathematical physics include 
statistical mechanics, quantum field theory, general relativity, or such topics in related 
mathematical disciplines as complexity theory, differential geometry, and group 
theory. Other interdisciplinary topics like artificial intelligence and neural networks 
are also included. Some volumes focus on narrow but engaging topics, while others 
provide a broad introduction or lay out the fundamentals of a field. 
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Preface 


Relations between mathematics and physics have a long and entangled tradition. 
In spite of repeated clashes resulting from the different aims and methods of the 
two disciplines, both sides have always benefitted. The place where contacts are 
most intensive is usually called mathematical physics, or if you prefer, physical 
mathematics. These terms express the fact that mathematical methods are needed 
here more to understand the essence of problems than as a computational tool, and 
conversely, the investigated properties of physical systems are also inspiring from 
the mathematical point of view. 

In fact, this field does not need any advocacy. When A. Wightman remarked 
a few years ago that it had become “socially acceptable”, it was a pleasant under- 
statement; by all accounts, mathematical physics is flourishing. It has long left the 
adolescent stage when it cherished only oscillating strings and membranes; nowadays 
it has built synapses to almost every part of physics. Evidence that the discipline is 
developing actively is provided by the fruitful oscillation between the investigation 
of particular systems and synthetizing generalizations, as well as by discoveries of 
new connections between different branches. 

The drawback of this rapid development is that it has become virtually impos- 
sible to write a textbook on mathematical physics as a single topic. There are, of 
course, books which cover a wide range of problems, some of them indeed monu- 
mental, but even they are like cities which govern the territory while watching the 
frontier slowly moving towards the gray distance. This is simply the price we have to 
pay for the flood of ideas, concepts, tools, and results that our science is producing. 

It was not our aim to write a poor man’s version of some of the big textbooks. 
What we want is to give students basic information about the field, by which we 
mean an amount of knowledge that could constitute the basis of an intensive one- 
year course for those who already have the necessary training in algebra and analysis, 
as well as in classical and quantum mechanics. If our exposition should kindle 
interest in the subject, the student will be able, after taking such a course, to read 
specialized monographs and research papers, and to discover a research topic to his 
or her taste. 

We have mentioned that the span of the contemporary mathematical physics is 
vast; nevertheless the cornerstone remains where it was laid by J. von Neumann, 
H. Weyl, and the other founding fathers, namely in regions connected with quantum 
theory. Apart from its importance for fundamental problems such as the constitution 
of matter, this claim is supported by the fact that quantum theory is gradually 
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becoming a basis for most branches of applied physics, and has in this way entered 
our everyday life. 

The mathematical backbone of quantum physics is provided by the theory of 
linear operators on Hilbert spaces, which we discuss in the first half of this book. 
Here we follow a well-trodden path; this is why references in this part aim mostly 
at standard book sources, even for the few problems which maybe go beyond the 
standard curriculum. To make the exposition self-contained without burdening 
the main text, we have collected the necessary information about measure theory, 
integration, and some algebraic notions in the appendices. 

The physical chapters in the second half are not intended to provide a self- 
contained exposition of quantum theory. As we have remarked, we suppose that the 
reader has background knowledge up to the level of a standard quantum mechanics 
course; the present text should rather provide new insights and help to reach a deeper 
understanding. However, we attempt to describe the mathematical foundations 
of quantum theory in a sufficiently complete way, so that a student coming from 
mathematics can start his or her way into this part of physics through our book. 

In connection with the intended purpose of the text, the character of referencing 
changes in the second part. Though the material discussed here is with a few 
exceptions again standard, we try in the notes to each chapter to explain extensions 
of the discussed results and their relations to other problems; occasionally we have 
set traps for the reader’s curiosity. The notes are accompanied by a selective but 
quite broad list of references, which map ways to the areas where real life dwells. 

Each chapter is accompanied by a list of problems. Solving at least some of 
them in full detail is the safest way for the reader to check that he or she has indeed 
mastered the topic. The problem level ranges from elementary exercises to fairly 
complicated proofs and computations. We have refrained from marking the more 
difficult ones with asterisks because such a classification is always subjective, and 
after all, in real life you also often do not know in advance whether it will take you 
an hour or half a year to deal with a given problem. 

Let us add a few words about the history of the book. It originates from courses 
of lectures we have given in different forms during the past two decades at Charles 
University and the Czech Technical University in Prague. In the seventies we pre- 
pared several volumes of lecture notes; ten years later we returned to them and 
rewrote the material into a textbook, again in Czech. It was prepared for publica- 
tion in 1989, but the economic turmoil which inevitably accompanied the welcome 
changes delayed its publication, so that it appeared only recently. 

In the meantime we suffered a heavy blow. Our friend and coauthor, Ji#f Blank, 
died in February 1990 at the age of fifty. His departure reminded us of the bitter 
truth that we usually are able to appreciate the real value of our relationships with 
fellow humans only after we have lost them. He was always a stabilizing element 
of our triumvirate of authors, and his spirit as a devoted and precise teacher is felt 
throughout this book; we hope that in this indirect way his classes will continue. 

Preparing the English edition was therefore left to the remaining two authors. 
It has been modified in many places. First of all, we have included two chapters and 
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some other material which was prepared for the Czech version but then left out due 
to editorial restrictions. Though the aim of the book is not to report on the present 
state of research, as we have already remarked, the original manuscript was finished 
four years ago and we felt it was necessary to update the text and references in 
some places. On the other hand, since the audience addressed by the English text is 
different — and is equipped with different libraries — we decided to rewrite certain 
parts from the first half of the book in a more condensed form. 

One consequence of these alterations was that we chose to do the translation 
ourselves. This decision contained an obvious danger. If you write in a language 
which you did not master during your childhood, the result will necessarily contain 
some unwanted comical twists reminiscent of the famous character of Leo Rosten. 
We are indebted to P. Moylan and, in particular, to R. Healey, who have read the 
text and counteracted our numerous petty attacks against the English language; 
those clumsy expressions that remain are, of course, our own. 

There are many more people who deserve our thanks: coauthors of our research 
papers, colleagues with whom we have had the pleasure of exchanging ideas, and 
simply friends who have supported us during difficult times. We should not forget 
about students in our courses who have helped just by asking questions; some of 
them have now become our colleagues. In view of the book complex history, the 
list should be very long. We prefer to thank all of them anonymously. However, 
since every rule should have an exception, let us name J. Dittrich, who read the 
manuscript and corrected numerous mistakes. Last but not least we want to thank 
our wives, whose patience and understanding made the writing of this book possible. 


Pavel Exner 
Miloslav Havliéek 


Chapter 1 


Some notions from functional 
analysis 


1.1 Vector and normed spaces 


The notion of a vector space is obtained by axiomatization of the properties of 
the three-dimensional space of Euclidean geometry, or of configuration spaces of 
classical mechanics. A vector (or linear) space V is a set {x,y,...} equipped 
with the operations of summation, [z,y] » x+y € V, and multiplication by a 
complex or real number a, [a,x] ++ ax € V, such that 


(i) the summation is commutative, 7+ y= y+, and associative, (c+y)+z= 
x+(y+z). There exist a zero element 0 € V, and an inverse element —z € V, 
toany x €V sothat x+0=~2 and x+(—z) =0 holds for all reV, 


(Dealir) = (of)c end lo—2., 


(iii) the summation and multiplication are distributive, a(z + y) =axr+ay and 
(a+ f~)zr=axr+ Br. 


The elements of V are called vectors. The set of numbers (or scalars) in the defini- 
tion can be replaced by any algebraic field F’. Then we speak about a vector space 
over F’, and in particular, about a complez and real vector space for F = C, R, 
respectively. A vector space without further specification in this book always means 
a complex vector space. 


1.1.1 Examples: (a) The space €” consists of n-tuples of complex numbers with 
the summation and scalar multiplication defined componentwise. In the same 
way, we define the real space JR”. 


(b) The space £, 1 < p < 00, of all complex sequences X := {€;}92, such that 
D521 IE? < 00 for p < oo and sup; |g;| < oo if p= oo, with the summation 
and scalar multiplication defined as above; the Minkowski inequality implies 
X+Y ef for X,Y € & (Problem 2). 
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(c) The space C(J) of continuous complex functions on a closed interval J C IR 
with (af+g)(x) := af(xz)+ (x). In asimilar way, we define the space C(X) 
of continuous functions on a compact X and spaces of bounded continuous 
functions on more general topological spaces (see the next two sections). 


A subspace L C V is a subset, which is itself a vector space with the same 
operations. A minimal subspace containing a given subset M C V is called the 
linear hull (envelope) of M and denoted as Mi, or lin(M). Vectors 21,..-,%n € V 
are linearly independent if a,2; +... + QnZn = 0 implies that all the numbers 
Q1,..-,Q, are zero; otherwise they are linearly dependent, which means some of 
them can be expressed as a linear combination of the others. A set M C V is 
linearly independent if each of its finite subsets consists of linearly independent 
vectors. 

This allows us to introduce the dimension of a vector space V as a maximum 
number of linearly independent vectors in V. Among the spaces mentioned in 
Example 1, €" and JR” are n-dimensional (€” is 2n—-dimensional as a real vector 
space) while the others are infinite-dimensional. A basis of a finite-dimensional 
V is any linearly independent set B C V such that Bryn = V ; it is clear that 
dimV = n iff V has a basis of n elements. Vector spaces V, V’ are said to 
be (algebraically) isomorphic if there is a bijection f : V — V’, which is linear, 
f(ax+y) =af(x) + f(y). Isomorphic spaces have the same dimension; for finite- 
dimensional spaces the converse is also true (Problem 3). 

There are various ways to construct new vector spaces from given ones. Let us 
mention two of them: 


(i) If Vi,...,Vy are vector spaces over the same field; then we can equip the 
Cartesian product V := V, x--- x Vy with a summation and scalar multipli- 
cation defined by a[x,,...,2n]+[y1,-.., yn] := (ati +y1,...,a2y+yn]. The 
axioms are obviously satisfied; the resulting vector space is called the direct 
sum of Vi,...,Vn and denoted as Vi @--- @ Vy or 3 V;. The same term 
and symbols are used if Vi,...,Vx are subspaces of a given space V such 
that each x € V has a unique decomposition x= 2, +---+2n, x; € Vj. 


(ii) If W is a subspace of a vector space V , we can introduce an equivalence 
relationon V by z~ y if z—y € W. Defining the vector-space operations on 
the set V of equivalence classes by a€+9 := (ax+y) forsome x € %, ye 7%, 
we get a vector space, which is called the factor space of V with respect to 
W and denoted as V/W. 


1.1.2 Example: The space £?(M,du), p > 1, where jz is a non-negative measure, 
consists of all measurable functions f : M — € satisfying fy,|f|Pdu < oo with 
pointwise summation and scalar multiplication — cf. Appendix A.3. The subset 
Lo C L? of the functions such that f(x) = 0 for almost all x € M is easily 
seen to be a subspace; the corresponding factor space L?(M,dy) := C?(M, du) /Lo 
is then formed by the classes of —equivalent functions. 
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Amap f : V + €ona vector space V is called a functional; if it maps into the 
reals we speak about a real functional. A functional f is additive if f(a+y) = f(x)+ 
f(y) holds for all x,y € V, and homogeneous if f(ax) = af (x) or antihomogeneous 
if f(ar) =af(x) for r€ V,a EC. An additive (anti)homogeneous functional is 
called (anti)linear. A real functional p is called a seminormif p(aty) < p(x)+p(y) 
and p(axr) = |a|p(x) holds for any z,y € V, a € C; this definition implies that p 
maps V into R* and |p(x)—p(y)| < p(x—y). The following important result is 
valid (see the notes to this chapter). 


1.1.3 Theorem (Hahn-Banach): Let p be a seminorm on a vector space V. Any 
linear functional fo defined on a subspace Vo C V and fulfilling | fo(y)| < p(y) for 
all y € Vo can be extended to a linear functional f on V such that |f(x)| < p(x) 
holds for any rE V. 


A map F:=Vx---xV—-C is called a form, in particular, a real form if its 
range is contained in R. A form F:VxV —C is bilinear if it is linear in both 
arguments, and sesquilinear if it is linear in one of them and antilinear in the other. 
Most frequently we shall drop the adjective when speaking about sesquilinear forms; 
we shall use the “physical” convention assuming that such a form is antilinear in 
the left argument. For a given F' we define the quadratic form (generated by F') by 
dr : Qr(x) = F(zx,x) ; the correspondence is one-to-one as the polarization formula 


F(z,y) = j(ar(o+y) ~ qr(z-y)) - + (ae(o-+iy) — qr(x—iy)) 


shows. A form F is symmetric if F (x,y) = F(y,x) for all z,y € V; it is positive 
if gr(x) > 0 for any x € V and strictly positive if, in addition, F(x) = 0 holds 
for x = 0 only. A positive form is symmetric (Problem 6) and fulfils the Schwarz 
inequality, 
|F(x,y)|? < ar(z)ar(y). 

A norm on a vector space V is a seminorm || - || such that ||x|] = 0 holds 
iff x =0. A pair (V,||- ||) is called a normed space; if there is no danger of 
misunderstanding we shall speak simply about a normed space V . 


1.1.4 Examples: (a) In the spaces €” and JR”, we introduce 


zd 1/p 
i) »P & i 
=) 


for x = {&,...,€n} ; the norm ||- |/2 on JR” is often also denoted as |- |. 
Analogous norms are used in £? (see also Problem 8). 


IZ lloo = max |€;| and \|z\|p = ( 


lsjsn 


cy 


(b) In L?(M, dy) , we introduce 


ile = (f, lePdu) 
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The relation ||f||,>=0 implies f(xz)=0 p-a.e. in M,so f is the zero element 
of L?(M,du). If we speak about L?(M,dy) as a normed space, we always 
have in mind this natural norm though it is not, of course, the only possibility. 
If the measure p is discrete with countable support, L?(M,d) is isomorphic 
to €? and we recover the norm ||- ||p of the previous example. 


(c) By L®(M,d) we denote the set of classes of w-equivalent functions f : 
M — C€, which are bounded a.e., i.e., there is c > 0 such that |f(x)| <c 
for p-almost all x € M. The infimum of all such numbers is denoted as 
supess|f(z)|. We can easily check that L°(M,du) is a vector space and 
fr |lf illo = sup ess -cyy|f(z)| is a norm on it. 


(d) The space C(X) can be equipped with the norm || f|l.o := supzex |f(z)|- 
A strictly positive sesquilinear form on a vector space V is called an inner (or 


scalar) product. In other words, it is a map (-,-) from V x V to € such that 
the following conditions hold for any z,y,z€V andaeC: 


(i) (t,ay+z) =a(z,y) + (z,z), 
(ii) (2,y) =(y,2), 
@i) (px) 0 anda(ays)=0 ifume0. 


A vector space with an inner product is called a pre-Hilbert space. Any such 
space is at the same time a normed space with the norm ||z|| := ,/(#,xz) ; the 
Schwarz inequality then assumes the form 


I(z,y)| < [all ty. 


The above norm is said to be induced by the inner product. Due to conditions (i) 
and (ii) it fulfils the parallelogram identity, 


cn+yll? + lao—yll? = 2l|x|)? + lly)? ; 


on the other hand, it allows us to express the inner product by polarization, 


(2,y) = (lle-+ul? — evi?) — £ (\jo-biull? — heigl). 


These properties are typical for a norm induced by an inner product (Problem 11). 

Vectors x,y of a pre-Hilbert space V are called orthogonal if (x,y) =0. A 
vector x is orthogonal to a set M if (x,y) =0 holds for all y € M ; the set of all 
such vectors is denoted as M+ and called the orthogonal complement to M. Inner- 
product linearity implies that it is a subspace, (M+),in = M+, with the following 
simple properties 


(Min)* = M+, Min c(M+)+, McN = Mton-, 
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A set M of nonzero vectors whose every two elements are orthogonal is called an 
orthogonal set; in particular, M is orthonormal if ||x|| = 1 for each sr € M. Any 
orthonormal set is obviously linearly independent, and in the opposite direction we 
have the following assertion, the proof of which is left to the reader. 


1.1.5 Theorem (Gram-Schmidt): Let N be an at most countable linearly inde- 
pendent set in a pre-Hilbert space V , then there is an orthonormal set M Cc V of 
the same cardinality such that Min = Nun. 


1.2 Metric and topological spaces 


A metric ona set X isa map 9: X x X — (0,00), which is symmetric, o(x, y) = 
o(y,z), o(x,y) =0 iff x=y, and fulfils the triangle inequality, 


o(x,z) < o(z,y) + oly, 2), 


for any x,y,z € X ; the pair (X,@) is called a metric space (we shall again for 
simplicity often use the symbol X only). If X is a normed space, one can define 
a metric on X by o(z,y) := ||z—y]] ; we say it is induced by the norm (see also 
Problems 15, 16). 

Let us first recall some basic notions and properties of metric spaces. An e- 
neighborhood of a point x € X is the open ball U.(z) := {y € X : o(y,x) < e}. 
A point x is an interior point of aset M if there isa U.(x) C M. A set is open if 
all its points are interior points, in particular, any neighborhood of a given point is 
open. A union of an arbitrary family of open sets is again an open set; the same is 
true for finite intersections of open sets. 

The closure M of a set M is the family of all points x €.X such that the 
intersection U.(z) 1M #@ for any « > 0. A point x € M is called isolated if there 
is U-(x) such that U.(z) 1M = {x}, otherwise x is a limit (or accumulation) point 
of M. The closure points of M which are not interior form the boundary bd M 
of M. A set is closed if it coincides with its closure, and M is the smallest closed 
set containing M (cf. Problem 17). In particular, the whole X and the empty set 
@ are closed and open at the same time. 

A set M is said to be dense ina set NC X if MD N;; it is everywhere dense 
if M =X and nowhere dense if X \ M is everywhere dense. A metric space which 
contains a countable everywhere dense set is called separable. An example is the 
space €” with any of the norms of Example 1.1.4a where a dense set is formed, 
e.g., by n-tuples of complex numbers with rational real and imaginary parts; other 
examples will be given in the next chapter (see also Problem 18). 

A sequence {tn} C X converges to a point x € X if to any U(x) there is 
mo such that x, € U,(x) holds for all n > no. Since any two mutually different 
points x,y € X have disjoint neighborhoods, each sequence has at most one limit. 
Sequences can also be used to characterize closure of a set (Problem 17). 

Next we recall a few notions related to maps f : X — X' of metric spaces. The 
map f is continuous at a point x € X if to any UL(f(x)) there is a Us(x) such 
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that f(Us(x)) C U!(f(x)) ; alternatively we can characterize the local continuity 
using sequences (Problem 19). On the other hand, f is (globally) continuous if the 
pull-back f'-"(G’) of any open set G’ C X' is openin X. 

An important class of continuous maps is represented by homeomorphisms, 1.e., 
bijections f : X — X’ such that both f and f~’ are continuous. It is clear that 
in this way any family of metric spaces can be divided into equivalence classes. A 
homeomorphism maps, in particular, the family + of open sets in X bijectively 
onto the family 7’ of open sets in X'; we say that homeomorphic metric spaces are 
topologically equivalent. Such spaces can still differ in metric properties. As an ex- 
ample, consider the spaces IR and (—%, 5) with the same metric 0(z,y) := |r—y| ; 
they are homeomorphic by z + arctanz but only the first of them contains un- 
bounded sets. A bijection f := X — X’ which preserves the metric properties, 
o'( f(x), f(y)) = o(x, y), is called isometry; this last named property implies conti- 
nuity, so any isometry is a homeomorphism. 

A homeomorphism f : V — V’ of normed spaces is called linear homeomor- 
phism if it is simultaneously an isomorphism. Linearly homeomorphic spaces there- 
fore also have the same algebraic structure; this is particularly simplifying in the 
case of finite dimension (Problem 21). In addition, if the identity || f(x)||v- = |lz|lv 
holds for any x € V we speak about a linear isometry. 

A sequence {Zn} in a metric space X is called Cauchy if to any € > 0 there 
is ne such that 0(%n,2m) < € for all n,m > n,. In particular, any convergent 
sequence is Cauchy; a metric space in which the converse is also true is called 
complete. Completeness is one of the basic “nontopological” properties of metric 
spaces: recall the spaces IR and (—5, 5) mentioned above; they are homeomorphic 
but only the first of them is complete. 


1.2.1 Example: Let us check the completeness of L?(M,du),p > 1, with ao-finite 
measure 1. Suppose first 44(M) < oo and consider a Cauchy sequence {f,} C L?. 
By the Holder inequality, it is Cauchy also with respect to || - ||; , so for any ¢ > 0 
there is N(€) such that ||f,z—fmlli < € for n,m > N(e). We pick a subsequence, 
9n ?= fk, , by choosing k; := N(2-') and kai = max{ka+1, N(2-"-))}, so 
I|Gn+1—Jnll1 < 2-", and the functions yp := |gi| + D221 |ge+1—ge| obey 


n—1 
[ead < Ng + S026 < 1+ [gull 
t=1 


Since they are measurable and form a nondecreasing sequence, the monotone— 
convergence theorem implies existence of a finite limpoo Yn(x) for y-aa. ce M. 
Furthermore, |9n4p—9n| < Yn+p—Yn, SO there is a function f which is finite p-a.e. 
in M and fulfils f(x) = limptoo gn(x). The sequence {g,} has been picked from a 
Cauchy sequence and it is therefore Cauchy also, |\gn—gmllp < € for all n,m > N(e) 
for a suitable N(e). On the other hand, limp. \9n()—Gm(x)|? = lgn(x) — f(x) |? 
for p-a.a. x € M, so Fatou’s lemma implies ||gn—f|lp < € a all n > N(e) ; hence 
f eT and — lfn—f\lp = 0 (Problem 24). 
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If w is o-finite and 4(M) = oo, there is a disjoint decomposition U2, M; = M 
with p(M;) < co. The already proven completeness of L?(M;, du) nit the 
existence of functions f) € L?(M;,du) which fulfil ||f9—f% ||, +0 as n— co; 
then we can proceed as in the proof of completeness of & (cf. Problem 23). 


Other examples of complete metric spaces are given in Problem 23. Any metric 
space can be extended to become complete: a complete space (X’, 0’) is called the 
completion of (X,) if (i) X C X’ and o'(x,y) = o(z,y) for all x,y € X , and (ii) 
the set X is everywhere dense in X’ (this requirement ensures minimality — cf. 
Problem 25). 


1.2.2 Theorem: Any metric space (X,g) has a completion. If (X, 6) is another 
completion of (X, 0), there is an isometry f : X’ > X which preserves X, ie., 
f(z)=a2 forall re xX. 


Sketch of the proof: Uniqueness follows directly from the definition. Existence is 
proved constructively by the so-called standard completion procedure which genera- 
lizes the Cantor construction of the reals. We start from the set of all Cauchy 
sequences in (X, 9). This can be factorized if we set {z;} ~ {y;} for the sequences 
with lim; 0(xz;,y;) = 0. The set of equivalence classes we denote as X* and 
define 0*(([z], [y]) := limjio o(x;,y;) to any [a], [y] € X*. Finally, it is necessary to 
check that this definition makes sense, 7.e., that o* does not depend on the choice 
of sequences representing the classes [z],[y], o* is a metricon X*, and (X*, 0°) 
satisfies the requirements of the definition. {f 


The notion of topology is obtained by axiomatization of some properties of 
metric spaces. Let X be a set and 7 a family of its subsets which fulfils the 
following conditions (topology axioms): 


(t1) X Er and Jer, 
(t2) if J is any index set and G, €7 for all a€ J; then Uges Ga ET, 
(t3) M1 G; €7 for any finite subsystem {G,...,Gn} Cr. 


The family 7 is called a topology, its elements open sets and the set X equipped 
with a topology is a topological space; when it is suitable we write (X,7). 

A family of open sets in a metric space (X,e) is a topology by definition; 
we speak about the metric-induced topology T,, in particular, the norm-induced 
topology if X is a vector space and g is induced by a norm. On the other hand, 
finding the conditions under which a given topology is induced by a metric is a 
nontrivial problem (see the notes). Two extreme topologies can be defined on any 
set X : the discrete topology Ta := 2% , i.e., the family of all subsets in X , and the 
trivial topology 7 := {0,X}. The first of them is induced by the discrete metric, 
oa(x,y) := 1 for x#y, while (X,70) is not metrizable unless X is a one-point set. 

An open set in a topological space X containing a point x oraset MCX is 
called a neighborhood of the point X or the set M,, respectively. Using this concept, 
we can adapt to topological spaces most of the “metric” definitions presented above, 
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as well as some simple results such as those of Problems 17a,c, 19b, topological 
equivalence of homeomorphic spaces, etc. On the other hand, equally elementary 
metric-space properties may not be valid in a general topological space. 


1.2.3 Example: Consider the topologies Tyin and Tcoun. on X = [0,1] in which 
the closed sets are all finite and almost countable subsets of X , respectively. If 
{z,} C X is a simple sequence, tp#%m for n#m; then any neighborhood U(z) 
contains all elements of the sequence with the exception of a finite number; hence 
the limit is not unique in (X,7yin). This is not the case in (X,Tcount) but there 
only very few sequences converge, namely those with zp =n, for all n > mo, which 
means, in particular, that we cannot use sequences to characterize local continuity 
or points of the closure. 


Some of these difficulties can be solved by introducing a more general notion of 
convergence. A partially ordered set J is called directed if for any a,@ € J there 
is y € I such that a < y and 8 < y. A map of a directed index set J into 
a topological space X, @ +> Zq, is called a net in X. A net {xq} is said to 
converge to a point x € X if to any neighborhood U(x) there is an ap € J such 
that xr. € U(x) for all a > ao. To illustrate that nets in a sense play the role that 
sequences played in metric spaces, let us mention two simple results the proofs of 
which we leave to the reader (Problem 29). 


1.2.4 Proposition: Let (X,7), (X’,7’) be topological spaces; then 


(a) a point x € X belongs to the closure of a set M C X iff there is a net 
{ta} C M such that 2-72, 


(b) a map f : X — X’ is continuous at a point x € X iff the net {f(xrQ)} 
converges to f(x) for any net {rq} converging to 2. 


Two topologies can be compared if there is an inclusion between them, 7; C 72 , 
in which case we say that 7 is weaker (coarser) than 72 ; while the latter is stronger 
(finer) than 7,. Such a relation between topologies has some simple consequences 
— see, €.g., Problem 32. In particular, continuity of amap f : X — Y is preserved 
when we make the topology in Y weaker or in X stronger. In other cases it may 
not be preserved; for instance, Problem 3.9 gives an example of three topologies, 
Tw CT C Ty, ona set X := B(H) and a map f : X — X which is continuous 
with respect to T and 7, but not 7,. 


1.2.5 Example: A frequently used way to construct a topology on a given X 
employs a family F of maps from X to a topological space (X,7). Among all 
topologies such that each f € F is continous there is one which is the weakest; its 
existence follows from Problem 30, where the system S consists of the sets f(-»)(G) 
for each G C7, f € F. We call this the F-weak topology. 
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For any set M in a topological space (X,7) we define the relative topology Tu 
as the family of intersections MMG with G C 7; the space (M,7y) is called 
a subspace of (X,7). Other important notions are obtained by axiomatization of 
properties of open balls in metric spaces. A family B C 7 is called a basis of a 
topological space (X,7) if any nonempty open set can be expressed as a union of 
elements of B. A family B, of neighborhoods of a given point x € X is called a 
local basis at x if any neighborhood U(z) contains some B € B,. A trivial example 
of both a basis and a local basis is the topology itself; however , we are naturally 
more interested in cases where bases are rather a “small part” of it. It is easy to see 
that local bases can be used to compare topologies. 


1.2.6 Proposition: Let a set X be equipped with topologies 7, r’ with local bases 
B,, Bi, at each x € X. The inclusion 7 C 7’ holds iff for any B € B, there is 
B’ € Bi such that B/C B. 


To be a basis of a topology or a local basis, a family of sets must meet certain 
consistency requirements (cf. Problem 30c,d); this is often useful when we define a 
particular topology by specifying its basis. 


1.2.7 Example: Let (X;,7;), 7 = 1,2, be topological spaces. On the Cartesian 
product X; x X2 we define the standard topology Tx,xx, determined by 7;, 7 = 
1,2, as the weakest topology which contains all sets G; x G2 with G; € 7;, ie., 
TX,xX_ = 7(T1 X72) in the notation of Problem 30b. Since (Ai x A2)N(Bi x Be) = 
(A, N By) x (Az Be), the family 7 x 72 itself is a basis of 7x,xx, ; a local basis 
at [x1,22] consists of the sets U(x1) x V(re), where U(x1) € 1, V(x2) € 7 are 
neighborhoods of the points x;,%2, respectively. The space (X; x Xo, Tx,xx,) is 
called the topological product of the spaces (X;,7;), 7 = 1,2. 


Bases can also be used to classify topological spaces by the so-called countability 
axioms. A space (X,7T) is called first countable if it has a countable local basis at 
any point; it is second countable if the whole topology 7 has a countable basis. The 
second requirement is actually stronger; for instance, a nonseparable metric space 
is first but not second countable (cf. Problem 18; some related results are collected 
in Problem 31). The most important consequence of the existence of a countable 
local basis, {U,(x): n = 1,2,...} C 7, is that one can pass to another local basis 
{Vn(x): mn =1,2,...}, which is ordered by inclusion, Vz41; C Vn, setting V; := U; 
and Va41 := Va Un41. This helps to partiallly rehabilitate sequences as a tool in 
checking topological properties (Problem 33a). 

The other problem mentioned in Example 3, namely the possible nonuniqueness 
of a sequence limit, is not related to the cardinality of the basis but rather to the 
degree to which a given topology separates points. It provides another classification 
of topological spaces through separability axioms: 


T, toany z,y€ X, r#y, there is a neighborhood U(x) such that y ¢ U(a), 


T2 toany z,ye€ X, r#y, there are disjoint neighborhoods U(x) and U(y), 
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T; toany closed set F C X andapoint x ¢ F,, there are disjoint neighborhoods 
U(x) and U(F), 


T, to any pair of disjoint closed sets F, F’, there are disjoint neighborhoods 
U(F) and U(F’). 


A space (X,7T) which fulfils the axioms T, and T; is called T;-space, T2-spaces 
are also called Hausdorff , T3-spaces are regular, and T,-spaces are normal. For 
instance, the spaces of Example 3 are T, but not Hausdorff ; one can find examples 
showing that the whole hierarchy is nontrivial (see the notes). In particular, any 
metric space is normal. The question of limit uniqueness that we started with is 
answered affirmatively in Hausdorff spaces (see Problem 29). 


1.3 Compactness 


One of the central points in an introductory course of analysis is the Heine-Borel the- 
orem, which claims that given a family of open intervals covering a closed bounded 
set F C IR, we can select a finite subsystem which also covers F. The notion of 
compactness comes from axiomatization of this result. Let M be a set in a topo- 
logical space (X,7). A family P := {M, : a € I} C 2% is a covering of M if 
Uaer Ma D M ; in dependence on the cardinality of the index set J the covering 
is called finite, countable, etc. We speak about an open covering if P C r. The 
set M is compact if an arbitrary open covering of M has a finite subsystem that 
still covers M ; if this is true for the whole of X we say that the topological space 
(X,7) is compact. It is easy to see that compactness of Wf is equivalent to com- 
pactness of the space (M,7,,) with the induced topology, so it is often sufficient to 
formulate theorems for compact spaces only. 


1.3.1 Proposition: Let (X,7) be a compact space, then 
(a) any infinite set M C X has at least one accumulation point, 
(b) any closed set F C X is compact, 
(c) ifamap f: (X,7) — (X',7’) is continuous, then f(X) is compact in (X’,7’). 


Proof: To check (a) it is obviously sufficient to consider countable sets. Suppose 
M = {tn : n=1,2,...} has no accumulation points; then the same is true for the 
sets My := {tn : n > N}. They are therefore closed and their complements form 
an open covering of X with no finite subcovering. Further, let {G.} be an open 
covering of F' ; adding the set G := X \ F we get an open covering of X. Any 
finite subcovering G of X is either contained in {G,} or it contains the set G ; 
in the latter case G\ G is a finite covering of the set F'. Finally, the last assertion 
follows from the appropriate definitions. 
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Part (a) of the proposition represents a particular case of a more general result 
(see the notes) which can be used to define compactness; another alternative defini- 
tion is given in Problem 36. Compactness has an important implication for the way 
in which the topology separates points. 


1.3.2 Theorem: A compact Hausdorff space is normal. 


Proof: Let F,R be disjoint closed sets and y € R. By assumption, to any x € F 
one can find disjoint neighborhoods U,(r) and Uz(y). The family {U,(x) : 2 € F} 
covers the set F', which is compact in view of the previous proposition; hence there 
is a finite subsystem {U,(z;) : 7 = 1,...,n} such that U,y(F) := Uji, Uy(z;) is a 
neighborhood of F’. Moreover, U(y) := (Vj, Uz,(y) is a neighborhood of the point 
y and U(y)NU,(F) =@. This can be done for any point y € R giving an open 
covering {U(y): y € R} of the set R ; from it we select again a finite subsystem 
{U(yx) : k = 1,...,m} such that U(R) := URL, U(ye) is a neighborhood of R 
which has an empty intersection with U(F) := (yi, Uy,(F). 


1.3.3 Theorem: Let X be a Hausdorff space, then 
(a) any compact set F Cc X is closed, 


(b) if the space X is compact, then any continuous bijection f : X — X’ for xX’ 
Hausdorff is a homeomorphism. 


Proof: If y ¢ F, the neighborhood U(y) from the preceding proof has an empty 
intersection with F', so y ¢ F. To prove (b) we have to check that f(F) is closed 
in X’ for any closed F Cc X ; this follows easily from (a) and Proposition 1c. § 


A set M in a topological space is called precompact (or relatively compact) if 
M is compact. A space X is locally compact if any point x € X has a precompact 
neighborhood; it is o-compact if any countable covering has a finite subcovering. 

Let us now turn to compactness in metric spaces. There, any compact set is 
closed by Theorem 3 and bounded — from an unbounded set we can always select an 
infinite subset which has no accumulation point. However, these conditions are not 
sufficient. For instance, the closed ball $,(0) in £? is bounded but not compact: its 
subset consisting of the points X; := {6j.}%2,,j = 1,2,..., has no accumulation 
point because ||X;—X¢|| = V2 holds for all 7 #k. 

To be able to characterize compactness by metric properties we need a stronger 
condition. Given a set M in a metric space (X,0) and € > 0, we call aset N- 
an é€-lattice for M if to any x € M there isa y € Ne such that o(z,y) < € 
(N. may not be a subset of M but by using it one is able to construct a 2e- 
lattice for M which is contained in M). A set M is completely bounded if it has a 
finite e-lattice for any ¢ > 0; if the set X itself is completely bounded we speak 
about a completely bounded metric space. If M is completely bounded, the same is 
obviously true for M. Any completely bounded set is bounded; on the other hand, 
any infinite orthonormal set in a pre-Hilbert space represents an example of a set 
which is bounded but not completely bounded. 
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1.3.4 Proposition: A o-compact metric space is completely bounded. A com- 
pletely bounded metric space is separable. 

Proof: Suppose that for some ¢ > 0 there is no finite e-lattice. Then X\S.(x1) #0 
for an arbitrarily chosen x, € X, otherwise {x;} would be an e-lattice for X. 
Hence there is x2 € X such that 0(z1,22) > € and we have X\(S.(x1)US.(%2)) 4 0 
etc.; in this way we construct an infinite set {z; : j = 1,2,...} which fulfils 
o(x;,2~) > € for all 7 Ak, and therefore it has no accumulation points. As for the 
second part, if N,, isa (1/n)-lattice for X , then U%, N, is a countable everywhere 
dense set. 


1.3.5 Corollary: Let X be a metric space; then the following conditions are 
equivalent: 


(i) X is compact, 
(ii) X is o-compact, 


(iii) any infinite set in X has an accumulation point. 


1.3.6 Theorem: A metric space is compact iff it is complete and completely 
bounded. 


Proof: Let X be compact; in view of Proposition 4 it is sufficient to show that it 
is complete. If {z,} is Cauchy, the compactness implies existence of a convergent 
subsequence so {rp} is also convergent (Problem 24). On the other hand, to prove 
the opposite implication we have to check that any M := {z,: n=1,2,...}C X 
has an accumulation point. By assumption, there is a finite l-lattice N, for X , 
hence there is y; € N; such that the closed ball Si(y,) contains an infinite subset 
of M. The ball S(y,) is completely bounded, so we can find a finite (1/2)-lattice 
N2 C Si(y) and a point y2 € No such that the set S)2(y2)M is infinite. In this 
way we get a sequence of closed balls S,, := So:i(y,) such that each of them contains 
infinitely many points of M and their centers fulfil yay; € S,. The closed balls 
of doubled radii then satisfy S2.(Yn41) C Sean(yn) and M has an accumulation 
point in view of Problem 26. 


1.3.7 Corollary: (a) A set M in a complete metric space X is precompact iff 
it is completely bounded. In particular, if X is a finite-dimensional normed 
space, then M is precompact iff it is bounded. 


(b) A continuous real-valued function f on a compact topological space X is 
bounded and assumes its maximum and minimum values in X . 


Proof: The first assertion follows from Problem 25. If M is compact, it is bounded 
so M is also bounded. To prove the opposite implication in a finite-dimensional 
normed space, we can use the fact that such a space is topologically isomorphic to 
€” (or JR” in the case of a real normed space — see Problem 21). As for part (b), 
the set f(X) C JR is compact by Proposition lc, and therefore bounded. Denote 
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Q := supzcx f(x) and let {z,} C X be a sequence such that f(rn) > a. Since X 
is compact there is a subsequence {z,;,,} converging to some 2x, and the continuity 
implies f(x,) = a@. In the same way we can check that f assumes a minimum 
value. 


1.4 ‘Topological vector spaces 


We can easily check that the operations of summation and scalar multiplication in 
a normed space are continuous. Let us now see what would follow from such a 
requirement when we combine algebraic and topological properties. A vector space 
V equipped with a topology 7 is called a topological vector space if 


(tvl) the summation maps continuously (Vx V, tyxv) to (V,7), 
(tv2) the scalar multiplication maps continuously (CxV, texv) to (V,7), 
(tv3) (V,7) is Hausdorff. 


In the same way, we define a topological vector space over any field. Instead of (tv3), 
we may demand 7}-separability only because the first two requirements imply that 
T3 is valid (Problem 39). 

A useful tool in topological vector spaces is the family of translations, tz : V > 
V , defined for any x € V by t(y) := x+y. Since tZ! = t_,, the continuity of 
summation implies that any translation is a homeomorphism; hence if G is an open 
set, then x+G:=t,(G) is open for all xz € V ; in particular, U is a neighborhood 
of a point x iff U =x+U(0), where U(0) is a neighborhood of zero. This allows 
us to define a topology through its local basis at a single point (Problem 40). 

Suppose a map between topological vector spaces (V,7) and (V’,7’) is simul- 
taneously an algebraic isomorphism of V, V’ and a homeomorphism of the corre- 
sponding topological spaces, then we call it a linear homeomorphism (or topological 
isomorphism). As in the case of normed spaces (cf. Problem 21), the structure of a 
finite-dimensional topological vector space is fully specified by its dimension. 


1.4.1 Theorem: Finite—-dimensional topological vector spaces (V,r) and (V’,7’) 
are linearly homeomorphic iff dim V = dimV’. Any finite~dimensional topological 
vector space is locally compact. 

Proof: It is sufficient to construct a linear homeomorphism of a given n-dimensional 
(V,7) to ©". We take a basis {e1,...,en} C V and construct f : V > @” by 
ff (Giins &e;) := [€,,...,&] ; in view of the continuity of translations we have to 
show that f and f~! are continuous at zero. According to (tv1), for any U(0) Er 
we can find neighborhoods U;(0) such that )%_,2; € U(0) for xz; € U;(0), 7 = 
1,...,n and f7! is continuous by Problem 42a. To prove that f is continuous 
we use the fact that V is Hausdorff: Proposition 1.3.1 and Theorem 1.3.3 together 
with the already proven continuity of f~! ensure that S.:= {reEV : ||f(x)|| =} 
= f‘-)(K.) is closed for any € > 0 ; we have denoted here by K. the e-sphere 
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in €™. Since 0 ¢ S-, the set G := V \ S, is a neighborhood of zero, and by 
Problem 42b there is a balanced neighborhood U Cc G of zero; this is possible only 
if ||f(x)||<e forallreU. § 


Next we want to discuss a class of topological vector spaces whose properties are 
closer to those of normed spaces. In distinction to the latter the topology in them is 
not specified generally by a single (semi)norm but rather by a family of them. Let 
P := {p, : a € I} be a family of seminorms on a vector space V where / is an 
arbitrary index set. We say that P separates points if to any nonzero x € V there 
is a Pa € P such that po(x) #0. It is clear that if P consists of a single seminorm 
p it separates points iff p is a norm. Given a family P we set 


B.(p1,.++)Pn) = {CEV : p(x) <e€,j=1,...,n} ; 


the collection of these sets for any € > 0 and all finite subsystems of P will be 
denoted as B?. In view of Problem 40, B) defines a topology on V which we 


denote as Tr”. 


1.4.2 Theorem: If a family P of seminorms on a vector space V separates points, 
then (V,7”) is a topological vector space. 

Proof: By assumption, to a pair x,y of different points there is a p € P such that 
€:= 5p(x—y) > 0. Then U(r) := 2+ B.(p) and U(y) := y + Be(p) are disjoint 
neighborhoods, so the axiom Tp) is valid. The continuity of summation at the point 
[0,0] follows from the inequality p(x+y) < p(x)+ p(y) ; for the scalar multiplication 
we use the estimate p(ax—aoz%o) < |a — ao| p(xo) + |alp(x—axo). OF 


A topological vector space with a topology induced by a family P separating 
points is called locally convex. This name has an obvious motivation: if z,y € 
B.(pi,--+,;Pn), then p;(ta+(1—t)x) < tp;(x)+(1—t)p;(x) holds for any t € (0, 1] 
so the sets B.(p),...,Pn) are convex. The convexity is preserved at translations, so 
the local basis of 7” at each x € V consists of convex sets (see also the notes). 


1.4.3 Example: The family P := {p, := |(z,-)| : 2 € V} in a pre-Hilbert 
space V generates a locally convex topology which is called the weak topology and 
is denoted as Ty ; it is easy to see that it is weaker than the “natural” topology 
induced by the norm. 


1.4.4 Theorem: A locally convex space (V,r) is metrizable iff the family P of 
seminorms which generates the topology 7 is countable. 


Proof: If V is metrizable it is first countable. Let {U; : 7 = 1,2,...} bea local 
basis of 7 at the point 0. By definition, to any U; we can find € > 0 anda 
finite subsystem P; C P such that (pep, Be(p) C U;. The family P’ :-= UR, P; 
is countable and generates a topology +” which is not stronger than t := 7? ; 
the above inclusion shows that 7?’ = r. On the other hand, suppose that 7 is 
generated by a family {p, : n = 1,2,...} separating points; then we can define 
a metric @ as in Problem 16 and show that the corresponding topology satisfies 
T =T (Problem 43). & 
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A locally convex space which is complete with respect to the metric used in the 
proof is called a Fréchet space (see also the notes). 


1.4.5 Example: The set S(JR”) consists of all infinitely differentiable functions 
f : IR" — C such that 


Ifllax == sup |2”(D* f)(2)| < c 


holds for any multi- wy aay, ONG), Koran k,, . ke )ewitheeekeenon= 
negative integers, where z7 := €]!...€/>, DK := AlKl/oeP...0¢ and |K| := 
ki +---+k,. It is easy to see that any such f and any derivative D* f (as well 
as polynomial combinations of them) tend to zero faster than |z7|~1 for each J ; 
we speak about rapidly decreasing functions. It is also clear that any ||- |lz« is a 
seminorm, with ||flloo = ||fll.., and the family P := {|| - |lzx} separates points. 
The corresponding locally convex space S(JR”) is called the Schwartz space; one 
can show that it is complete, i.e., a Fréchet space (see the notes). 

An important subspace in S(JR”) consists of infinitely differentiable functions 
with a compact support; we denote it as C§°(JR”) . It is dense, 


Ce UR") = SUR"), (1.1) 


with respect to the topology of SUR”) (Problem 44). 


1.5 Banach spaces and operators on them 


A normed space which is complete with respect to the norm—induced metrics is called 
a Banach space. We have already met some frequently used Banach spaces — see 
Example 1.2.1 and Problem 23. In view of Problem 21, any finite-dimensional 
normed space is complete; in the general case we have the following completeness 
criterion, the proof of which is left to the reader (see also Example 3b below). 


1.5.1 Theorem: A normed space V is complete iff to any sequence {zn} C V 
such that 7%, ||rnl| < co there is an x € V such that z = limpo ha Ze (or 
in short, iff any absolutely summable sequence is summable). 


Given a noncomplete norm space, we can always extend it to a Banach space by 
the standard completion procedure (Problem 46). A set M in a Banach space ¥ 
is called total if Mi, = X. Such a set is a basis if M is linearly independent and 
dim X < oo, while an infinite-dimensional space can contain linearly independent 
total sets, which are not Hamel bases of ¥ (cf. the notes to Sec.1.1). 


1.5.2 Lemma: (a) If M is total in a Banach ar *, then any set N C # dense 
in M is totalin %. 


(b) A Banach space which contains a countable total set is separable. 
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Proof: Part (a) follows from the appropriate definitions. Suppose that M = 
{x1,2o,...} is totalin 4 and G.; is the countable set of complex numbers with ra- 
tional real and imaginary parts; then the set L:= { D4, yj: Y € Cu, TLCS} 
is countable. Since Cra: is densein C,weget L=¥. Oi 


1.5.3 Examples: (a) The set P(a,6) of all complex polynomials on (a,6) is an 
infinite-dimensional subspace in C{a, b] := C({a,6]).. By the Weierstrass theo- 
rem, any f € C[a,b] can be approximated by a uniformly convergent sequence 
of polynomials; hence C[a,}] is a complete envelope of (P(a,), ||- ll). The 
set {2*: k=0,1,...} is total in C[a,}], which is therefore separable. 


(b) Consider the sequences Ex := {6jx}$2, in &,p>1. Foragiven X = {€} € 
&, the sums X, := f,¢;H; are nothing else than truncated sequences, so 
limpoo ||X—Xnllp = 0. Hence {E, : k= 1,2...} is a countable total set and 
f? is separable. Notice also that the sequence {€;E,}52, is summable but it 
may not be absolutely summable for p> 1. 


(c) Consider next the space L?(JR",du) with an arbitrary Borel measure pz on 
IR”. We use the notation of Appendix A. In particular, 7” is the family of all 
bounded intervals in JR” ; then we define S) := {yy : J € J}. It is a subset 
in L? and the elements of its linear envelope are called step functions; we can 
check that S™) is total in L?UR",dw) (Problem 47). Combining this result 
with Lemma 2 we see that the subspace Cp°(JR") is dense in LPR", dy) ; 
in particular, for the Lebesgue measure on JR” the inclusions Cf°UR") Cc 
SUR") c LPR") yield 


(CoPUR)), = (SUR")), = LPR"). (1.2) 


(d) Given a topological space (X,r) we call C,(X) the set of all continuous 
functions on X with the following property: for any € > 0 there is a compact 
set K C X such that |f(x)| <e€ outside K. It is not difficult to check that 
C..(X) is a closed subspace in C(X) and Co(X) = C,.(X), where Co(X) 
is the set of continuous functions with compact support (Problem 48). In the 
particular case X = IR”, C§°(UR") is dense in C..(IR") (see the notes), so 


(Co? CR") 0 = (SUR")).. = Coo(IR”). (1.3) 


There are various ways in, which it is possible to construct new Banach spaces 
from given ones. We mention two of them (see also Problem 49): 


(i) Let {4 : 7 = 1,2,...} be a countable family of Banach spaces. We denote 
by * the set of all sequences x := {xj}, x; € Xj, such that ¥; |la;||j < 00, 
and equip it with the “componentwise” defined summation and scalar mul- 
tiplication. The norm ||X||e := ¥; ||z,||; turns it into a Banach space; the 
completeness can be checked as for £? (Problem 23). The space (4, ||: |l) is 
called the direct sum of the spaces X;, 7 = 1,2,..., and denoted as DD Xj . 


1.5. BANACH SPACES AND OPERATORS ON THEM i 


(ii) Starting from the same family {4% : 7 = 1,2,...}, one can define another 
Banach space (which is sometimes also referred to as a direct sum) if we change 
the above norm to ||X||.. := sup; ||z;||; replacing, of course, ¥ by the set of 
sequences for which ||X||.. < oo. The two Banach spaces are different unless 
the family {4%;} is finite; the present construction can easily be adapted to 
families of any cardinality. 


A map B : Vi — V2 between two normed spaces is called an operator; in 
particular, it is called a linear operator if it is linear. In this case we conventionally 
do not use parentheses and write the image of a vector x € V; as Bz. In this book 
we shall deal almost exclusively with linear operators, and therefore the adjective 
will usually be dropped. A linear operator B : V; — V2 is said to be bounded if 
there is a positive c such that ||Bz|l2 < cz]; for all « € VY ; the set of all such 
operators is denoted as B(V,, V2) or simply B(V) if Vi = Vo := V. One of the 
elementary properties of linear operators is the equivalence between continuity and 
boundedness (Problem 50). 

The set B(V,,V2) becomes a vector space if we define on it summation and 
scalar multiplication by (aB + C)x := aBxr +Czx. Furthermore, we can associate 
with every B € B(V;, V2) the non-negative number 


||Bl| <= sup ||Bzlle, 
Si 


where S; :={xeEV,: |\xl]1 =1} is the unit sphere in V; (see also Problem 51). 


1.5.4 Proposition: The map B+ ||B|| isanormon B(V;, V2). If V2 is complete, 
the same is true for B(V;, V2), i.e., it is a Banach space. 

Proof: The first assertion is elementary. Let {B,} be a Cauchy sequence in 
B(V,,V2) ; then for all n,m large enough we have ||B,—Bm|| < €, and there- 
fore ||B,r—Bmzll2 < ellz||1 for any 2 € V,. As a Cauchy sequence in V2, {B,x} 
converges to some B(x) € V2. The linearity of the operators B, implies that 
z+ Ba is linear, B(x) = Br. The limit m — oo in the last inequality gives 
||Bx—B,x\\o < e|lz||1, 80 B € B(V,, V2) by the triangle inequality, and ||B—B,|| < € 
for all n large enough. & 


The norm on B(V;,V2) introduced above is called the operator norm. It has an 
additional property: if C : V; —~ Vo and B: V2 — V3 are bounded operators, 
and BC is the operator product understood as the composite mapping V; > V3, 
we have ||B(Cz)||3 < ||Bll ||Czll2 < ||BI ||Cll Ilzl]1 for all 2 € Vj, so BC is also 
bounded and 
|BC|| < ||BI|ICl. f (1.4) 
Let V, be a subspace of a normed space V,. An operator B: V; — Vo is called 
a restriction of B: V; — V2 to the subspace V, if Ba = Bx holds for all x € V;, 
and on the other hand, B is said to be an ertension of B ; we write B = B)} V, or 
BC B. Another simple property of bounded operators is that they can be extended 
uniquely by continuity. 
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1.5.5 Theorem: Assume that 4, ¥2 are Banach spaces and V, is a dense sub- 
space in 4 ; then any B € B(V;, X2) has just one extension B € B(4,, X92), and 
moreover, ||B|| = ||Bll. 

Proof: For any x € X we can find a sequence {z,} C V, that converges to x. 
Since B is bounded, the sequence {B2x,} is Cauchy, so there isa y € 2 such that 
||Brn—yll2 > 0. We can readily check that y does not depend on the choice of 
the approximating sequence and the map z+ y is linear; we denote it as B. If 
x € V, one can choose z, = x for all n, which means BhVY,=B. Passing to the 
limit in the relation ||Bzn|l2 < ||B]l lat we get B € B(X,, X2) and ||Bi| < ||BI| ; 
since B is an extension of B the two norms must be equal. Suppose = that 
B=C}V, for some C € B(A\, X%2). We have Cipg= Bt, , =A, 2). .@homemy 
approximating sequence, and therefore C= B. Uf 


Notice that in view of Proposition 4, B(V,,4’) is complete even if V; is not. The 
approximation procedure used in the proof to define B is often of practical impor- 
tance, namely if we study an operator whose action on some dense subspace is given 


by a simple formula. 


1.5.6 Example (Fourier transformation): Following the usual convention we denote 
the scalar product of the vectors x,y € IR” by x-y and set 


fy) = ny"? [et f(z)dz and f(y) = f(-y) (1.5) 


for any f € SUR") and y € JR”. The function f is well- defined and one can check 
that it belongs to S(IR*) (Problem 52), i.e., that Fo : Fof = f isa linear map of 

S(R") onto itself. We want to prove that la: f  f is its inverse. To this end, 
we use the relation from Problem 52; choosing g, := e~®’!!"/?: we get 


ix-y—e|2|2/2 F = —|z|?/2 
ae € F(y) dy i. e f(x+ez) dz 


a any € > 0. The two integrated functions can be majorized independently of 
; the limit « > 0+ then yields AyFof = f. Since f(r) = ie x) we also get 
Fefof = f for all f € SUR"), i.e., the relation 


(Fo'f)(x) = (Fof)(-2). 


Using Theorem 5, we shall now construct two important extensions of the operator 
Fo. For the moment, we denote by S,(JR") the normed space (S(IR"), || - ||p) ; we 
know from (1.2) that it is dense in L?UR"), p> 1. 


(i) Since SUR") is a subset of C.(IR") and || filo < (27)-"/2|| fl, , the operator 
Fo can also be understood as an element of B(S\(IR"), C.(UR")). As such 
it extends uniquely to the operator F € B(L'UR"), C,.(UR")) ; it is easy to 
check that its action on any f € L'(JR") can be expressed again by the first 
one of the relations (1.5), 


(FA) uy) = (2ny*? fo e**¥F(0) dr. 
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The function Ff is called the Fourier transform of f. We have Ff € 
C..(UR") , and therefore 
ae (Foy u0 


this relation is often referred to as the Riemann-Lebesgue lemma.. 


(ii) Using once more the relation from Problem 52, now with g := f= f, we find 


[,.\Fo)Pay =f \ftu)Pdy 


for any f € S(UJR"). This suggests another possible interpretation of the 
operator Fo as an element of B(S2(IR”), L?(JR")). Extending it by continuity, 
we get the operator F € B(L?(JR")), which is called the Fourier-Plancherel 
operator or briefly F'P-operator; if it is suitable to specify the dimension of 
IR” we denote it as F,,. The above relation shows that ||F fello = || fxll2 holds 
for the elements of any sequence {fx} C Sp approximating a given f € L’, 
and therefore 
FP flle = Iflle 


for all f € L?(UR") ; this implies that F is surjective (Problem 53). Hence 
the Fourier—Plancherel operator is a linear isometry of L?(IR") onto itself. 


We are naturally interested in how F' acts on the vectors from L?\S. There 
is a simple functional realization for n = 1 (see Example 3.1.6). In the general 
case, the right sides of the relations (1.5) express (F'f)(y) and (F-'f)(y) as 
long as f € L*M L}. To check this assume first that supp f C J, where J is 
a bounded interval in JR” and consider a sequence {f,} approximating f ac- 
cording to Problem 47d. By the Hélder inequality || f—frll1 < uw(J)!/?||f—felle , 
so || f—fxl|1 + 0, and consequently, the functions Fof, converge uniformly to 
Ff. On the other hand Fof, = F fk, 80 ||Fofe — Ff lle = ll fe—f lle + 0; the 
sought expression of Ff then follows from the result mentioned in the notes 
to Section 1.2. A similar procedure can be used for a general f € L?ML!: one 
approximates it, e.g., by the functions f; := fx;, where x; are characteristic 
functions of the balls {xz € IR”: |z| <j}. 

For the remaining vectors, f € L?\ L’, the right side of (1.5) no longer makes 
sense and Ff must be defined as a limit, e.g., ||F f — Ff;|l2 +0, where f; 
are the truncated functions defined above. The last relation is often written 
as 


(FAY) = Lim. jrco(2ey"? fem ¥ 4 (2) de, 


lzl<j 


where the symbol |.i.m. (limes in medio) means convergence with respect to 
the norm of L?(JR"). 


A particular role is played by operators that map a given normed space into (. 
We call B(V,C) the dual space to V and denote it as V*; its elements are bounded 
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linear functionals. Comparing it with the algebraic dual space defined in the notes, 
we see that V* is a subspace of V/, and moreover, a Banach space with respect 
to the operator norm; the two dual spaces do not coincide unless dimV < oo 
(Problem 54). The Hahn-Banach theorem has some simple implications. 


1.5.7 Proposition: Let Vo be a subspace of a normed space V ; then 
(a) for any fo € Vv there is an f € V* such that f)Vo = fo and |lfoll = Ifill, 


(b) if Vo AV, then to any z ¢ Vp one can find f, € V* with ||f,|| = 1 such that 
fz)}Vo =0 and f,(z) = d(z) := infyey |lz—yl]. 


Proof: The first assertion follows from Theorem 1.1.3 with p= || fol || - ||. To prove 
(b), we have to check first that x++ d(x) is aseminorm on V and d(x) = 0 holds 
iff x € Vo. Then we take the subspace V; := {zr = ytaz: ye Vo,a€ C} and set 
fi(z) := ad(z), in particular, f;(z) = d(z). This is obviously a linear functional 
and |fi(y+az)| = d(az) < |ly+az||, so fi € V* and |{fi|| < 1. On the other 
hand, we have d(z) = |f;(y—z)| < |lfill ly—zll ; hence || f,|| = 1 and the sought 
functional f, is obtained by extending f, to the whole V in accordance with the 
already proven part (a). & 


1.5.8 Corollary: (a) To any nonzero xz € V there is a functional f, € V* such 
that fe(x) = ||z|| and || fel] =1. 


(b) The family V* separates points of V. 


(c) If the dual space 4* to a Banach space 4 is separable, then 4 is also 
separable. 


Proof: The first two assertions follow immediately from Proposition 7. Let further 
{fr: n= 1,2,...} bea dense set in ¥*. To any nonzero f, we can find a unit 
vector ty € V such that |fa(wa)| > || fall. In view of Lemma 2 it is sufficient 
to check that M := {z, : n = 1,2,...} is total in 4. Let us assume that 
Vo := Min # * ; then Proposition 7 implies the existence of a functional f € 4’* 
such that ||f|| = 1 and f(x) =0 for 2 € Vo. For any ¢ > 0 we can find a nonzero 
fn such that ||fr—f|| <e€, ae, || fall > 1 —e; hence we arrive at the contradictory 
conclusion € > ||fn — fll 2 |fn(@n) — f(an)| = |fa(tn)| > 3 Il fall > 70 =e ‘f 


One of the basic problems in the Banach-space theory is to describe fully 2* 
for a given 4. We limit ourselves to one example; more information can be found in 
the notes and in the next chapter, where we shall show how the problem simplifies 
when 4 is a Hilbert space. 


1.5.9 Example: The dual (¢)*, p > 1, is linearly isomorphic to @, where g := 
p/(p—1) for p> 1 and q:= oo for p=1. To demonstrate this, we define 


fy(X) = sy eM 
k=] 


1.5. BANACH SPACES AND OPERATORS ON THEM 21 


for any sequences X := {€,} € @ and Y := {nm} € €7; then the Hdlder inequality 
implies that fy is a bounded linear functional on @? and ||fy|| < ||Y\l;. The 
map Y +> fy of & to (é)* is obviously linear; we claim that this is the sought 
isometry. We have to check its invertibility. We take an arbitrary f € (@?)* and set 
Mk = f( Ex), where E;, are the sequences introduced in Example 3b; then it follows 
from the continuity of f that the sequence Y; := {m}f2, fulfils f = fy,. To show 
that Y+ € &, consider first the case p > 1. The vectors X, := R_, sgn (7) |ne|2-? , 
where sgnz := z/|z| if z #0 and zero otherwise, fulfil ||Xnllp = (RL, Inel?)” 
and f(Xn) = Vika Ime|?, so the inequality |f(Xn)| < lI fll |Xnllp yields 


oo 1/g 
& nl) SP Wallies 7 dg) sce 
k=1 


If p=1 we have |f(£,)| = |p| and ||En|l1 = 1,s0 sup, |mal < |||] . Hence in both 
cases the sequence Y; € @?, and the obtained bounds to its norm in combination 
with the inequality || fy|| < ||Yllq, which we proved above, yield ||f|| = ||Y¢llq- 


The dual space of a given V is normed, so we can define the second dual V*™* := 
(V*)* as well as higher dual spaces. For any x € V we can define J, € V* by 
Jr(f) := f(x). The map x + J, is a linear isometry of V to a subspace of 
V** (Problem 55); if its range is the whole V** the space V is called reflezive. 
It follows from the definition that any reflexive space is automatically Banach. In 
view of Example 9, the spaces @ are reflexive for p > 1, and the same is true for 
I?(M,du) (see the notes). On the other hand, ¢! and C(K) are not reflexive, 
and similarly, L'(M,d) is not reflexive unless the measure p has a finite support. 
Below we shall need the following general property of reflexive spaces, which we 
present without proof. 


1.5.10 Theorem: Any closed subspace of a reflexive space is reflexive. 


The notion of dual space extends naturally to topological vector spaces: the 
dual to (V,7) consists of all continuous linear functionals on V ; in this case we 
often denote it alternatively as V’. It allows us to define the weak topology Tw 
on V as the weakest topology with respect to which any f € V’ is continuous, or 
the V’—weak topology in the terminology introduced in Example 1.2.5 ; in the next 
chapter we shall see that the definition is consistent with that of Example 1.4.3. We 
have T, C T because each f € V’ is by definition continuous with respect to 7, 
and it is easy to check that 7, coincides with the topology generated by the family 


Pw :={ps: f €V'}, where py(x) :=|f(x)|. 

1.5.11 Proposition: If (V,7) is a locally convex space, then P, is separating 
points of V and the space (V,7 ~) is also locally convex. 

Proof: If V is a normed space, the assertion follows from Corollary 8; for the general 
case see the notes. 


If a sequence {z,} C V converges to some x € V with respect to 7, we write 
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x = w-limMpoLn OF Ln — x. Let us list some properties of weakly convergent 
sequences (only the case dim V =oo is nontrivial — see Problem 56). 


1.5.12 Theorem: (a) t, > 2 iff f(zn) > f(z) holds for any f EV". 
(b) Any weakly convergent sequence in a normed space is bounded. 


(c) If {xp} is a bounded sequence in a normed space V and g(Zn) — g(x) for all 
g of some total set in F C V*, then 2, > 2. 


Proof: The first assertion follows directly from the definition. Further, we use 
the uniform boundedness principle which will be proven in the next section: if 
In — x£, then the family {Ww} C V** with Yn(f) := f(zn) fulfils the assumptions 
of Theorem 1.6.1, which yields ||wp|| = |lznl| < c for some c > 0. As for part 
(c), we have g(Xn) — g(x) for all g € Fin. Since F is total by assumption, for 
any f € V*, e > O there is a g € Fin and a positive integer n(€) such that 
| f—gll <€ and |9(r,) — g(x)| <€ for n > n(¢) ; this easily yields |f(z,)— f(x)| < 
(1+ |lxzn|| + |lz||)e. However, the sequence {z,} is bounded, so f(tn) + f(x). Of 


1.5.13 Example (weak convergence in £?, p > 1): In view of Examples 3b and 
9, the family of the functionals fi({€;}921) :-= €, k = 1,2,..., is total in (é)*. 
This means that a sequence {X,} C @?, Xn = Ge }$2,, converges weakly to 
X = {&}32, © & iff it is bounded and e” — € for 7 =1,2,.... For instance, 
the sequence {F,,} of Example 3b converges weakly to zero; this illustrates that the 
two topologies are different because {E,,} is not norm-convergent. 


A topological vector space (V,7) is called weakly complete if any sequence 
{xn} C V such that {f(zn)} is convergent for each f € V’ converges weakly 
to some x € V. A set M C V is weakly compact if any sequence {zn} C M 
contains a weakly convergent subsequence. 


1.5.14 Theorem: Let 4 be a reflexive Banach space, then 
(a) #& is weakly complete, 


(b) aset MC is weakly compact iff it is bounded. 


Proof: (a) Let {xn} C *# be such that {f(rn)} is convergent for each f € V*. 
The same argument as in the proof of Theorem 12 implies existence of a positive c 
such that the sequence {jn} CX, Ua(f) := f(an), fulfils |Yn(f)| < ell f||, n= 
1,2,..., for all f ¢ &*. The limit ¢(f) := limp. Ya(x) exists by assumption, 
the map f +> ~(f) is linear, and the last inequality implies f € 4**. Since 2 is 
reflexive, there is an x € X such that ¥(f) = f(x) for all fe€ X*, ie, mx. 
(b) If M is not bounded there is a sequence {zn} C M such that ||a_|| > 7 ; 
then no subsequence of it can be weakly convergent. Suppose on the contrary that 
M is bounded and consider a sequence X := {zn} C M ; it is clearly sufficient 


1.6. THE PRINCIPLE OF UNIFORM BOUNDEDNESS 23 


to assume that X is simple, zp #4 2m for n # m. In view of Theorem 10, 
Y := {rn}iin is a separable and reflexive Banach space, so Y** is also separable, 
and Corollary 8 implies that )* is separable too. Let {g; : j = 1,2,...} be 
a dense set in Y*. Since X is bounded the same is true for {g(z,)} ; hence 
there is a subsequence X; := {x} such that {g:(2)} converges. In a similar 
way, {go(x\)))} is bounded, so we can pick a subsequence X2 := {x2} C X, 
such that {go(x‘?))} converges, etc. This procedure yields a chain of sequences, 
X D-+- DX; D Xj41 D--+, such that {9;(x))}2,, 7 =1,2,..., are convergent. 
Now we set yn := 2"), so yn € X; for n> Jj and {g;(yn)}%, converges for any 
j; then {g(yn)}?, is convergent for all g € Y* due to Theorem 12c. The already 
proven part (a) implies the existence of y € Y such that g(yn) — g({y) for any 
g € Y*. Finally, we take an arbitrary f € 4* and denote gy := f)). Since 
97 © Y* we have f(Yn) = 97(Yn) — o¢(y) = f(y), and therefore y, >y. Wf 


1.6 The principle of uniform boundedness 


Any Banach space + is a complete metric space so the Baire category theorem 
is valid in it (cf. Problem 27). Now we are going to use this fact to derive some 
important consequences for bounded operators on +. 


1.6.1 Theorem (uniform boundedness principle): Let F be a subset in B(Y,V,), 
where ¥ isa Banach space and (Vj, ||-||1) isa normed space. If supge¢ ||Ba||1 < co 
for any x € %, then there is a positive c such that supge¢ ||Bl| <c. 


Proof: Since any operator B € F is continuous, the sets M,:= {xe : ||Balli < 
n forall B € F} are closed. Due to the assumption, we have ¥ = Ur2, M, 
and by the Baire theorem, at least one of the sets M, has an interior point, 7.e., 
there is a natural number 7, an Z € M,, and an e€ > 0 such that all x fulfilling 
|z—Z|| < € belong to Mj, and therefore supge¢ ||Bz||1 < 7. Let ye ¥ bea unit 
vector. We set ry := §y; then 2,+Z € Mj, and 


2 4n 
é 


2 A . 
Bylli = = Baylis 2 B@y+2)hh + (Bel) < =; 


this implies ||B||< “ forall Be F. 


In what follows, +, Y are Banach spaces, U. and V;, are open balls in ¥ and 
y, respectively, of the radius ¢ > 0 centered at zero. By N° we denote the interior 
of aset N C J), i.e., the set of all its interior points. Any operator B € B(¥,y) 
is continuous, so the pull-back B‘-(G) of an open set GC ) is openin 2. If 
B is surjective, the converse is also true. 


1.6.2 Theorem (open-mapping theorem): If an operator B € B(X,)) is surjective 
and GC & is an open set, then the set BG is openin ). 


We shall first prove a technical result. 
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1.6.3 Lemma: Let B € B(X,y) and «> 0. If (BU.)° #9 or (BU.)° #0; then 
0 € (BU,)° or 0 € (BU,)°, respectively, holds for any 7 > 0. 

Proof: Let yo be an interior point of BU. ; then there is 6 > 0 such that yo+ 
V; C BU,, i.e. to any y € Vs there exists a sequence {2} C U. such that 
2) := Ba) — y+yo. In particular, 2© — yo, so 2¥—20) = y, and since 
Ila) 2) ||» < 2e we get Vs C BU... In view of Problem 33, cBU. = cBU, holds 
for any c>0. This implies that Vx C BU,, 7 := we so 0 is an interior point of 
BU,,. A similar argument can be used for (BU.)? 40. & 


Proof of Theorem 2: To any x € G we can find U, such that r+U, C G, ie., 
Ba+BU, Cc BG. If there is Vs C BU, the set BG is open; hence it is sufficient to 
check 0 € (BU,)° for any n > 0. We write ¥ = U7, Un ; since B is surjective, 
we have Y = U%, BU, and the Baire category theorem implies (BU;)° # 0 for 
some positive integer 7. We shall prove that BU; C BUog,. 

Due to the lemma, BU, contains a ball Vs, and this further implies Vs, C 
BU,, for j = 1,2,..., where 6; :-= 6/2) and np :=/2?. bet y eBUzysotemy 
neighborhood of y contains elements of BU, ; in particular, for the neighborhood 
y+Vs, we can find 2, € U,; such that Ba, € y+Vs, , and therefore also y— Ba € 
Vs, C BU, . Repeating the argument we see that there is an x2 € U,, such that 
y—Br,—Bz2 € Vs, C BU,, etc; in this way we construct a sequence {z;} C 
such that ||z;\|xy < 27/2? and 


j 
|y- Be. < 6;. 
k=1 y 
Then Theorem 1 implies the existence of limj...}.,2; =:  € X ; we have 


y = Bx because B is continuous. Now ||z|lv¥ < Re, |lzellx < 2%, s0 y € BUda ; 
this proves BU, C BUo,. Since (BU;)° # @ the set BU2g has an interior point; 
using the lemma again we find 0 € (BU,)° forany n>0. § 


Theorem 2 further implies the following often used result, the proof of which is 
left to the reader (Problem 58). 


1.6.4 Corollary (inverse-mapping theorem): If B € B(X,¥) is a bijection, then 
B~" is a continuous linear operator from ) to ¥. 


In the rest of this section, T means a linear operator defined on a subspace Dr 
of a Banach space ¥ and mapping Dr into a Banach space ) ; we do not assume 
T to be continuous. The subspace Dr is called the domain of the operator T and 
is alternatively denoted as D(T). 

The set I(T) := { [z, Tx] : c € Dr} is called the graph of the operator T; it 
is a subspace in the Banach space ¥ @ Y. In general, a subspace T C A OY is 
said to be a graph if each element [x,y] € I’ is determined by its first component. 
Any graph I determines the linear operator Tp from X¥ to Y with the domain 
D(Tr) = {xe #: [x,y] €P} by Trx := y for each [z,y] eT. It is clear that 
I'(Tr) = T° and conversely, a linear operator is uniquely determined by its graph, 
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Tyr) = T. An operator T is called closed if I'(T) is a closed set in Y @); the 
definition of the direct product gives the following equivalent expression. 


1.6.5 Proposition: An operator T is closed iff for any sequence {%,} C Dr such 
that tn > x and Tz, — y, we have x € Dr and y=Tz. 


Any bounded operator is obviously closed. On the other hand, there are closed 
operators which are not bounded (Problem 59). If ['(T) is not closed but its closure 
in X ® is a graph (which may not be true) the operator T is said to be closable 
and the closed operator 


UE Tray 
is called the closure of the operator T ; we have ['(T) = I'(T). Since I(T) is 
the smallest closed set containing ['(T) , the closure is the smallest closed extension 
of the operator JT’. Moreover, I(T) is a subspace in ¥ @ J), so it is a graph iff 
(0,y] € 1(7) implies y = 0. This property makes it possible to describe the closure 
sequentially. 


1.6.6. Proposition: (a) An operator T is closable iff any sequence {z,} C Dr 
such that 2, +0 and Tz, — y fulfils y=0. 


(b) A vector x belongs to D(T) iff T is closable and there is a sequence 
{tn} C Dr such that rz, + x and {Tz,} is convergent; if this is true, we 
have Tz, —~ Tx. 


We have mentioned that a closed operator may not be continuous. However, 
this can happen only if the operator is not defined on the whole space 1+. 


1.6.7 Theorem (closed-graph theorem): A closed linear operator T: X — Y with 
Dr = X is continuous. 


Proof: I(T) is by assumption a closed subspace in VY @ J, so it is a Banach space 
with the direct-sum norm, ||[z,y]|lo = |/zilx+|lylly. The map S,: 5,([z,T2x]) =a 
is a continuous linear bijection from I(T) to 4, and therefore S[' is continuous 
due to Corollary 4. The map S2: S2((x,Tzx]) = Tx is again continuous, and the 
same is true for the composite map S20.S;'=T. § 


1.7 Spectra of closed linear operators 


We denote by C(*) the set of all closed linear operators from a Banach space ¥ 
to itself; since such operators are not necessarily bounded, one has to pay attention 
to their domains. A complex number 4 is called an eigenvalue of an operator 
T € C(X) if T—AI is not injective, i.e., if there is a nonzero vector x € Dr 
such that Tx = Ax. Any vector with this property is called an eigenvector of T 
(corresponding to the eigenvalue \). The subspace Ker(7'— A) is the respective 
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eigenspace of T and its dimension is the multiplicity of the eigenvalue A ; in partic- 
ular, the latter is simple if dim Ker(T—A) = 1. Here and in the following we use 
T—X asa shorthand for T—AI where J is the unit operator on #. 

A subspace L C # is called T-invariant if Tx € L holds for all x € LN Dr ; 
we see that any eigenspace of T is T-invariant. Furthermore, Proposition 1.6.5 
gives the following simple result. 


1.7.1 Proposition: Any eigenspace of an operator T € C(*’) is closed. 


Let us now ask under which conditions the equation (T—A)x = y can be solved for a 
given ye X,A EC. Recall first the situation when ¥ := V is a finite-dimensional 
vector space. We know that the equation with y = 0 has then a nontrivial solution 
if A belongs to the spectrum of T which is defined as the set of all eigenvalues, 


o(T) := {AEC : thereisanonzerorEV, Tr=Azxr}. 


Since  € o(T) holds iff det(T—A) = 0, to find the spectrum is a purely algebraic 
problem; o(T) is a nonempty set having at most dimV elements. On the other 
hand, the equation has a unique solution if 4 ¢ o(T) and this solution depends 
continuously on y. In that case therefore T—2 is a bijection of V to itself, and 
we get an alternative way to describe the spectrum, 


o(T) = C€\ p(T), p(T) := {A€C: (T-A)* € BV) }. 


In an infinite-dimensional Banach space the two expressions are no longer equivalent: 
the inverse (T—)~! exists if is not an eigenvalue of T. but it may be neither 
bounded nor defined on the whole 4. The definition can be then formulated with 
the help of the second expression. Taking into account that, for T € C(#) such 
that T—A is invertible, the conditions (T—A)~'! € B(¥) and Ran(T—A) = XxX 
are equivalent by the closed~graph theorem and Problem 60b, we can define the 
spectrum of T € C(%) as the set of all complex numbers \ for which T—A is not 
a bijection of Dr onto 4. This may happen in two (mutually exclusive) cases: 


(i) T—A is not injective, i.e., X is an eigenvalue of T, 
(ii) T—A is injective but Ran (T—A) # 2X. 


The set of all eigenvalues of T forms its point spectrum o,(T). The remaining part 
of the spectrum which corresponds to case (ii) is divided as follows: the continuous 
spectrum o-(T) consists of those A such that Ran(T—.) is dense in 4 , while the 
points where Ran(T—A) # 4 form the residual spectrum o,(T) of T. In this way, 
the spectrum decomposes into three disjoint sets, 


o(T) = 0,(T) Uo-(T) U o,(T). (1.6) 


The set 
A(T) = C\o(T) = {AEC : (T-d)“ € B(X)} 
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is called the resolvent set of the operator T' and its elements are regular values. The 
map Rr: p(T) > B(#) defined by Rr(A) := (T—A)~! is called the resolvent of 
the operator T ; the same name is also often used for its values, 7.e., the operators 
Rr(A). The starting point for derivation of basic properties of the spectrum is the 
following result, which is an operator analogy of the geometric-series sum. 


1.7.2 Lemma: Let B € B(¥) and ||IJ—B|| <1; then B isinvertible, B-! € B(4%), 
and r fe 
B™ = lim )o(I-B)) =: $3 (I-B). 
n—0o $20 re 
Proof: Denote S, := D$_9(I-B)’. The inequality (1.4) gives ||S,—Smll < 
Vyem+i ||1— Bll? for any positive integers m and n > m, so {Sp} is Cauchy, 


and since B(4) is complete, it converges to some S € B(4’). We have BS, = 
SnB = 1+S,—Spr41 3 passing to the limit we get BS = SB=I1,1.e, S=B.9J 


1.7.3 Theorem: The resolvent set of an operator T € C(%) is open, containing to- 
gether with any 2p also its neighborhood U(Ao) := {A € €: |A-Ao| < ||Rr(Ao) 7}, 
and the resolvent is given by 


eae So ee aie (ane 
j=0 


for each A € U(Xo). 

Proof: The operator B) := I — Rr(Xo)(A—Ao) obeys the assumptions of the lemma 
if \ € U(Xo), then By! exists and belongs to B(4’). Using the obvious identities 
(T—A)Rr(Ao) = By and Rr(rAo)(T —A)x = Byx for all x € Dr, we can check 
that (T—)7! exists and (T—A)~! = Rr(Ao) Bx € B(X) ; hence X € p(T) 
and Rr(Ao)Bx* = Rr(A). To finish the proof, we have to substitute for By! the 
expansion from the above lemma. § 


1.7.4 Corollary: o(T) is a closed set. 


To formulate the next assertion, we need one more notion. A vector-valued 
function f : & — V, where V is a locally convex space, is said to be analytic in a 
region GC @ if the derivative 


1) = lim 1) fo) 
10) te in ae 
exists for any Ap) € G. One can extend to such functions some standard results of 
the theory of analytic functions such as the Liouville theorem; if V is, in addition, 
a Banach space the generalized Cauchy theorem and Cauchy integral formula are 
also valid — see the notes for more information. 


1.7.5 Theorem: Let B be a bounded operator on a Banach space 4 ; then the 
resolvent Rg : p(B) — B(#) is an analytic function, the spectrum o(B) is a 
nonempty compact set, and r(B) := sup{|A| : A € o(T) } = limnse || B"||/”. 
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1.7.6 Examples: (a) Consider the operators T and T® := 7} D©) of Problem 59. 
The function fy, : f(z) = e** belongs to Dr and Tf, = Af,, so o(T) = 
op(T) = © and p(T) =@. On the other hand, for a given A € C we define 
the operator S, : (Syg)(x) = ff e*-%g(t) dt which maps C{[0,1] to D®. 
We have (59)’ — A(S)g) = g for any g € C[0,1] and S)(f’—Af) = f for all 
f € D© , and therefore S, = (T—))7! ; in view of the closed-graph theorem 
and Problem 60b, S) is bounded, so we get p(T) = € and o(T) =9. 


(b) The same conclusions are valid for the operator P: Pf :=-if' on L(0,1) 
whose domain D(P) := AC[0, 1] consists of all functions that are absolutely 
continuous on the interval [0,1] with the derivatives in L?(0,1), and its re- 
striction PO := P){f € AC{0,1]: f(0) =0}. However, one has to use a 
different method to check that P and P©) are closed; we postpone the proof 
to Example 4.2.5. 


Notes to Chapter 1 


Section 1.1 A linearly independent set BC V such that Brn = V is called a Hamel 
basis of the space V and its cardinality is the algebraic dimension of V. Such a basis 
exists in any vector space (Problem 4). We are more interested, however, in other bases 
which allow us to express elements of a space as “infinite linear combinations” of the basis 
vectors. This requires a topology; we shall return to that problem in the next chapter. 

A set C CV is convex if together with any two points it contains the line segment 
connecting them. Any subspace L C V is convex, and the intersection of any family 
of convex sets is again convex. If a point x € C’ does not belong to the line segment 
connecting some y,z €C different from 2, it is called extremal. Equivalently, x € C is 
an extremal point of C if x=ty+(1—t)z with t € (0,1), y,z€C implies sc=y=z. 

The Hahn—Banach theorem is proven in most functional-analysis textbooks — see, 
e.g., [KF], [RS 1], [Tay]. The set of all linear functionals on a given V becomes a 
vector space if we equip it with the operations defined by (af + g)(x) := af(x) + g(a) ; 
we call it the algebraic dual space of V and denote as V‘. If V is finite-dimensional, 
one can check easily that V and V/ are isomorphic. 

We often need to know whether a given family F of maps F.: X — Y is “large 
enough” to contain for any pair of different elements z,y € X a map f such that 
f(z) 4 f(y) ; if this is true we say that F separates points. If X,Y are vector spaces and 
the maps in ¥ are linear, the family separates points if for any non-zero x € X there is 
fc € F such that fr(x) #0. This is true, in particular, for F = X/ (Problem 7). 


Section 1.2 As a by-product of the argument of Example 1 we have obtained the fol- 
lowing result: if a sequence {fn} C L?(M,dy) fulfils ||f.—fllp + 0, then there is a 
subsequence {fn,} such that fn, (x) > f(x) for u-a.a. x € M. In fact, this is true for 
any positive measure and the space L?(M, dy) is still complete in this case — see, e.g., 
[ Jar 2], Thm.68; [DS], Sec.III.6. 

An alternative way to introduce the notion of a topology is through axiomatization 
of properties of the closure — see Problem 28 and [Kel] for more details. The described 
construction of the topological product extends easily to any finite number of topological 
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spaces; for further generalizations see, e.g., [Nai 1], Sec.I.2.12. A discussion of the count- 
ability and separability axioms and their relations can be found in topology monographs 
such as [ Al] or [Kel], and also in most functional—analysis textbooks. For instance, any 
second countable regular space is normal; the fundamental result of Uryson claims that 
a second countable space is normal iff its topology is induced by a metric, thus giving a 
partial answer to the metrizability problem. 

A topological space (X,7) is connected if it cannot be written as a union of two non— 
empty disjoint open sets; this is equivalent to the requirement that X and @ are the only 
two sets which are simultaneously closed and open. A set M Cc X is connected if the 
space (M,r) with the induced topology is connected. A continuous map y: [0,1] + X 
is called a curve connecting the points y(0) and y(1). A topological space (or its subset) 
whose any two points can be connected by a curve is said to be arcwise (or linearly) 
connected. Such spaces are connected but the converse is not true. 


Section 1.9 There is a standard procedure called one-point compactification which allows 
us to construct for a given noncompact (X,7) a compact space (X',r’) such that (X,r) 
is its subspace and X’\ X = {xo} is a one-point set; the topology 1’ consists of all sets 
{xo}U(X\F), where F is a closed compact set in X — for more details see, e.g., [Tay], 
Sec.2.31. A simple example is the compactification of ( by adding to it the point oo. 

Any compact space is by definition o-compact but the converse is not true — cf. [KF], 
Sec.IV.6.4. An infinite set in a o0—-compact space has again at least one accumulation point; 
this is clear from the proof of Proposition 1. In some cases the notions of compactness 
and o-compactness coincide (cf. Corollary 5 and Problem 37c). 

A net {yg}sey is called a subnet of a net {ta}ae, if there isa map y: J I such 
that (i) ye = %,a) for all G € J, (ii) for any a’ € J there isa 6’ € J such that 3 > @’ 
implies y(G) > a’. Using this definition, we can state the Bolzano- Weierstrass theorem: 
A topological space is compact iff any net in X has a convergent subnet. A proof can be 
found, e.g., in [RS 1], Sec.IV.3. 

A map f ofa metric space (X,@) to (X’,Q') is uniformly continuous if to any € > 0 
there is 6 > 0 such that o’(f(21), f(a2)) < € holds for any pair of points 21,22 € X 
fulfilling o(%1, 22) < €. One can easily prove the following useful result: a continuous map 
of a compact space (X,@) to (X’,g') is uniformly continuous. 


Section 1.4 A converse to Theorem 1 was proved by F. Riesz: any locally compact 
topological vector space is finite-dimensional — see, e.g., | Tay ], Sec.3.3. Some metric— 
space notions do not extend directly to topological spaces but can be used after a suitable 
generalization. For instance, a set M in a topological vector space is bounded if to any 
neighborhood U of zero there is an a € @ such that M C aU . In view of Problem 42b, 
there is a positive b such that M Cc (U holds for all |G] >. It is easy to see that in 
a normed space this definition is equivalent to the requirement of existence of a cy > 0 
such that ||x|| << cy for all ze M. 

A locally convex space is often defined as a topological vector space in which any 
neighborhood of the point 0 contains a convex neighborhood of zero. We have seen that 
this is true for (V,7”). On the other hand, if a topological vector space (V,7T) has the 
stated property, there is a family of seminorms on V which separate points and generate 
the topology 7 — see, e.g., [Tay], Sec.3.8 — so the two definitions are really equivalent. 

A net {Za}aer in a topological vector space V is Cauchy if for any neighborhood U 
of 0 there isa y€J such that a.—xg €U holds for all a> y, B> 7. In particular, in 
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a locally convex space with a topology generated by a family P the condition reads: for 
any € > 0 and pe, there isa y €J such that p(ta—ag) <e foralla>y,B>y7. 
The space V is complete if any Cauchy net in it converges to some x € V. In a similar 
way, we define a Cauchy net in a metric space. Since such a net in a complete metric space 
is convergent — see, e.g., [DS 1], Lemma I.7.5 — a locally convex space with a topology 
generated by a countable family P is Fréchet iff it is complete in the sense of the above 
definition. The completeness of SJR") is proved, e.g., in {RS 1], Sec.V.3. 


Section 1.5 Combining the results of Example 3c with the inclusion Cg°UIR”) C CUR”)N 
LPUR”, du) we find that the set CUR") L?UR" dy) is also dense in L?(UR", du). This 
can be proved directly for a wider class of ZL? spaces — cf. [KF], Sec. VII.1.2. The proof 
of density of C§°UR™) in C..(JR") can be found, e.g., in | Yo], Sec.1.1. In a similar way, 
the set C§°(Q) for an open connected set 2 c JR”, which we shall mention below, is 
dense in L?7(Q) — ef. [RS 4], Sec.XIII.14. 

The map F: L'UR") — Cx(JR”) is injective as can checked easily: the relation 
Fg =0 for g € L' implies fren f(x)g(x) dx =0 for all f € SUR”); then f,9(x) dz =0 
holds by Problem 47 for any bounded interval J Cc JR” and therefore also for any Borel 
set in JR” ; this in turn means g = 0. On the other hand, F is not surjective — see, e.g., 
[KF ], Sec.VIII.4.7; [Jar 2], Sec.XIII.11. 

The product of functions f,g € SUR") belongs to SUR"). The relation from Prob- 
lem 52 implies that its Fourier transform is fg = (2m)~"/? f *, where 


(f+ 9)(0) = f fe-a)oly) av =f o(e—w so) dy. 


The map [f,g] ++ f *g is called the convolution. It is a binary operation on S(JR”) 
which is obviously bilinear and commutative; the relation f xg = (2m)"/?fg shows that it 
is also associative. Some important extensions of the convolution are discussed in [ Yo], 
Sec. V1.3; [RS 2], Sec.IX.1. 

While (é')* is isomorphic to £° , the dual (2°)* is not isomorphic to £!. This follows 
from Corollary 8c because £! is separable but £° is nonseparable due to Problem 18. 
The situation with the spaces L?(X,dy) is similar: (L?)*, p> 1, is linearly isomorphic 
to L% by fo: fold) = fx pwdu — see [Ru 1], Sec.6.16; [Tay], Sec.7.4. The spaces 
(L®)* and L! are again nonisomorphic with the exception of the trivial case when the 
measure p has a finite support; the expression of (L%°)* is given in [DS 1], Sec.IV.8. 
These results have implications for the reflexivity of the considered spaces as mentioned 
in the text. The proof of Theorem 10 can be found in [DS 1], Sec.II.3.23. 

Another example of Banach spaces in which one can find a general form of a bounded 
linear functional is represented by the spaces C(K) and Cr(K) of continuous functions 
(complex or real-valued, respectively) on a compact Hausdorff space K. The Riesz- 
Markov theorem associates the functionals with Borel measures on K . In the particular 
case of 4 = Cp[a,b] any such measure corresponds to a function F’ of a bounded vari- 
ation; the theorem then claims that for any f € 4* there is F of a bounded variation 
such that f(y) = f, ay 9 4F holds for all y € X, and moreover, ||/|| is equal to the total 
variation of F — i details see, e.g., | KF ], Sec. V1.6.6 or | RN], Sec.50. The general 
formulation and proof of the Riesz—~Markov theorem can be found in [ DS 1], Sec.IV.6. 

One of the most important examples of duals to topological vector spaces is the space 
S(R")’ whose elements are called tempered distributions, and the dual of the complete 
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locally convex space D(2) = (C§°(Q),7) for a given open connected set 2 C IR”. The 
space D(2) is the so-called inductive limit of the locally convex spaces (C§°(Kn), Tn) ; 
where {K,,} is a nondecreasing sequence of compact sets such that U, Kn = and the 
topologies tT, are determined by families of seminorms analogous to that of Example 1.4.5. 
The elements of D(R”)' are called distributions on 9 ; we have, in particular, S(R”)’ C 
DUR"). There is an extensive literature on the theory of distributions; in addition to 
special monographs such [Schw 1] or [GS] the reader may consult, e.g., [ RS 1-2], 
Chaps.V and IX; [ Yo], Sec.I.8 and Chap.VI; [KF], Sec.XIV.4, etc. 

The proof of Proposition 11 for a general locally convex space can be found in [DS 1], 
Sec.V.2. A set M in a Banach space * is weakly compact iff its weak closure is a 
compact set in the topological space (V,T~) — cf. [DS 1], Thm.V.6.1. Ifaset F CV’ 
on a topological vector space (V,7) is separating points, then {py := |f(-)| : f € F} 
determines a locally convex topology which is called the F-topology on V ; it is the 
weakest topology in which all the functionals f € F are continuous. It is clear that the 
smaller the set F’ is, the weaker is the corresponding F-topology. 


Section 1.6 In addition to the norm ||: ||g on 4 ®Y one sometimes introduces || - |f 
by |i [1,22] ff <= (llara||? + llara|I2)/? ; we can easily check that the two norms induce the 
same topology. It is sometimes useful to regard the graph of a given T as a subset in 
the Banach space (4 @ Y, |] - |}). Since the properties of the graph discussed here are 
characterized by topological means only, they are not sensitive to such a modification. 


Section 1.7 The notions of eigenvalue, eigenvector etc. make sense for any linear op- 
erator; however, if T is not closed it can have an eigenspace which is also not closed 
(Problem 61). On the other hand, the spectrum has been defined for the closed operators 
only because otherwise the notion is trivial: if there isa 4 € (such that (T—A)~! € B(AX), 
then T €C(#) in view of Problem 60b, so o(T) =@ holds if T is not closed. 

The theory of analytic vector-valued functions is discussed, e.g., in [DS 1], Sec.III.4; 
{ HP], Secs.3.10-3.15; [Nai 1], Secs.3.12 and 4.7; [Ru 2], Chap.3; [ Schw 2], Chap. VII. 
The case when the values of such a function are bounded operators on a Hilbert space 
is of particular importance; we can then introduce functions which are analytic weakly 
(strongly, in the operator norm) with respect to different topologies which we shall define 
in Section 3.1; however, the adjective may be dropped because the three notions coincide 
— see [HP], Sec.3.10; [RS 1]. Thm.VI.4. The resolvent expansion (1.7) is sometimes 
called the Neumann series after C. Neumann, who introduced it in the theory of integral 
equations. An alternative proof of Theorem 5 which avoids use of the Cauchy formula is 
given in [ BR 1], Prop.2.2.2. 


Problems 


1. Let p>1 and q:=p/(p—1), then the Hélder inequality, 


A 


3 


1/p 1/q 
> lémil < (= i) (x ms 
i} j 


is valid for any n-tuples {€;}, {nj} € ©” or sequences {€;} € & and {nj} € &. 
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ds 


10. 


Analogously, if f?,g? € L'(M,du), then fg € L1(M,du) and 


ie Ifgldu < (J, \fPan) é , atau) 


Hint: We have p~'a? + q~-1b*% > ab for any a,b>0. 


Let p>1. The Minkowski inequality, 


1/p 1/p 1/p 
(xiesar)” < (ser) "+ (s)” 
J 3 j 


is valid for any n-tuples {£;},{n;} € (” or sequences from £, as well as the 
analogous integral relation for functions f,g € L?(M, dp). 


Algebraic isomorphism preserves linear independence. An n-dimensional complex 
or real vector space is isomorphic to @” and JR”, respectively. 


Using the Zorn lemma, prove that a Hamel basis exists in any vector space V, and 
moreover, that for any linearly independent set M c V there is a Hamel basis 
containing M. 


Let Vi, V2 be subspaces of a vector space V. Prove 
(a) dimVj+dim V2 = dim(V; + V2) +dim(V NV2), where Vi + V2 := (Vi UV2)uin 
is the algebraic sum of the subspaces Vi, V2, 
(b) (Vi @ V2)/V, is algebraically isomorphic to V2, 
(c) dimV/V; = dimV —dimV, provided dimV; < oo. 


Let qr be the quadratic form generated by a sesquilinear form F'. Prove 


(a) F is symmetric iff gr(z) € IR forall re R, 


(b) if F is positive, pr : pr(x) = V/qr(z) is a seminorm and the Schwarz 
inequality is valid; it turns into equality iff there are a, GEC, |a|+|6| 40, 
such that gr(ax + By) =0. 


The algebraic dual V/ is separating points in a vector space V. 


Let 1 <p <q; then ||zIloo < |lzllq < Ilzllp < n!/lIa\|.0 holds for any x € ©". The 
first two inequalities are also valid in £?, where || - {|g makes sense for any q > p. 
In this case, however, there are no positive k,k’ such that ||X|lp < k||X|loo and 


|Xllo < k’\|X||q with p <q holds for all X ef. 


Let 1<p<q and p(M) < oo; then ||-||, isa norm on L?(M,dy) and the inequa- 
lity w(M)—/4|\f\lq < w(M)-/?||f|lp holds for any f € L?(M,dy). Furthermore, 
L®(M, du) C LP(M, du) and ||f lp < u(M)'?||flloo for any f € L°(M, dy). 


Suppose that a sequence {fp}?2, C L~°(M,dy) is uniformly bounded, i.e., there 
is a positive c such that ||fnllo <c for all n, and that f,(x) — f(x) p-ae. in 
M ;then f € L©(M,dp) and ||fllo <c. 
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A norm on a vector space is induced by an inner product iff it fulfils the parallelogram 
identity. 

Hint: Using the parallelogram identity show that r(x, y) := ((le+yll?— |la—yll?) /4 
is additive. This implies further that it is homogeneous for rational a, and hence 
for all a € IR by continuity. 


Among the norms of Examples 1.1.4, only {j- ||2 is induced by an inner product. 


Prove Theorem 1.1.5. Using it, show that any finite-dimensional pre—Hilbert space 
contains an orthonormal basis. 

Hint: For N = {y,ye,...} consider the vectors 2p := 2n/|lZn||, where zn := 
Yn — Wha (Lk Yn) ee - 

To any finite subset {z1,...,2n} C V its Gram determinant I'(21,...,%n) corre- 


sponds to the matrix whose entries are the inner products (2;,%%), j,k =1,...,n. 
It is non-negative and equals zero iff the vectors 7%),...,2%n are linearly dependent. 


A metric g on a vector space V is induced by a norm ?ff it is translation-invariant, 
o(x+z,y+z) = o(x,y), and homogeneous at dilatations, o(ax, ay) = |alo(z,y) for 
all z,y,zEV, aed. 


Let P = {pn : n= 1,2,...} be a countable family of seminorms on a vector space 
V which is separating points; then g : o(z,y) = 222,27" BS is a metric on 
V_ which is induced by no norm. 


Let (X,9) be a metric space. Prove 
(a) aset M is closed iff X \ M is open. The intersection of an arbitrary family 
of closed sets is a closed set; the same is true for finite unions of closed sets, 


(b) a point x belongs to M iff there is a sequence {2%,} C M which converges 
to ©, 


Ce em M for any M c X. Furthermore, MC N > M CW and 
MUN =MUN, while for intersections we have only MAN CMNN, 


(d) for a set M Cc X the following conditions are equivalent: (i) M is nowhere 
dense, (ii) M has no interior points, (iii) M is dense in no open ball. 


Let M be everywhere dense in a metric space X , and let B be a family of disjoint 
open balls in X ; then card B < card M. In particular, X is nonseparable if there 
exists B which is not countable. Using this result, prove that the spaces £° and 
L®(R, dx) are nonseparable. 


Let f : X — X’ be a mapping between two metric spaces. Prove 


(a) f is continuous at a point r € X iff f(rn) > f(x) holds for any sequence 
{tn} which converges to z, 


(b) f is continuous iff it is continuous at each point x e€ X. 


Consider a pair of normed spaces V; := (V,||-||;), 7 =1,2, with the same V. 
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(a) The norm ||-||2 is said to be dominated by ||- ||; if there is c > 0 such that 
|z\l2 < ellxll, holds for all 2 € V. Prove that this is equivalent to continuity 
of the identical map Vi; — V2, and this in turn is equivalent to the relation 
T2 C T; between the corresponding norm-induced topologies, 


(b) the two norms are eguivalent if they are mutually dominated; this is true iff 
the identical map V; — V2 is a homeomorphism. If this is the case, Vi is 
complete iff V2 is complete, and the two norms induce the same topology, 


(c) if V; are Banach spaces and ||- |l2 is dominated by ||-||1, then the two norms 
are mutually equivalent. 


Hint: (c) Use the inverse-mapping theorem. 


Finite-dimensional normed spaces V, V’ are linearly homeomorphic iff they have 
the same dimension, dim V = dimV’. 
Hint: Any two norms on a finite-dimensional space are mutually equivalent. 


Let M be a set in a separable metric space (X,g) ; then (M, g) is separable. 
Hint: If {xj :j =1,2,...} is everywhere dense, for any natural number k we can 
find y;x € M such that 0(x;, jx) < o(2;,M) + . 


Check completeness of the following metric spaces: €” and JR” with the norms of 
Example 1.1.4a, 2° and & with p>1, C(X) and L™(M, dp). 

Hint: If {46” Ne. "1, Dee. } C & is Cauchy, we have fh, |e El? <e 
for any N. As for C(X), a uniformly convergent sequence of continuous functions 
has a continuous limit. 


If a Cauchy sequence {%,}92., contains a convergent subsequence, limp—oo fn, = 2; 
GHen Wa C= 


Let Y bea subset of a complete metric space (X, @). The space (Y, 9) is complete 
iff Y is closed. 


A metric space X is complete iff any sequence of closed balls S, = S.,(%n) such 
that Sn4i C Sn, n = 1,2,..., and limpooén = 0 has a nonempty intersection, 


Mp=1 Sn #0. 


Using the previous result, prove the Baire category theorem: If a metric space X 
is complete and {M, : n = 1,2,...} is a countable family of its nowhere dense 
subsets, then UP, M, #X. 

Hint: Since M, 4 X, there is an open ball U; disjoint with M, and a closed ball 
S; C U,. Furthermore, M2 is nowhere dense, so there is an open ball Uz C Sj 
disjoint with M2 etc.; in this way we construct a sequence of embedded balls whose 
intersection is not contained in any of the sets M,. 


Suppose a map M + [M] on 2% satisfies the Kuratowski arioms: M C [M], 
[MUN] =[M]U[N], [0] = and [[M]] = [M] ; then the set family {G € 2% : 
[X \ G] = X \G} is a topology on X and the corresponding closure M is equal 
to [M] forany MCX. 
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29. Prove Proposition 1.2.4. Show that a topological space (X,7) is Haussdorf iff any 
net {ta} C X has at most one limit. 


30. Prove: (a) if {ta : @ € I} is a family of topologies on X, then (\gerTa is a 

topology which is weaker than each T., 

(b) to any S C 2* there is a topology r(S) such that S C 7(S) and any other 
topology 7 containing S is stronger than 7(S). A basis of 7(S) consists of 
the set X together with the intersections of all finite subsystems of S, 

(c) asystem BC 2* is a basis of the topology 7(B) on X iff (i) for any rE X 
there is a B € B such that z € B and (ii) forany B,C Ee B and re BNC 
there isa Dé B such that re DC BNC, 


(d) suppose that a nonempty system B, C 2* corresponds to any x € X and 
the following conditions are valid: (i) z € B for all Be Br, (ii) for any 
pair C,D € B, there is aset Be B, suchthat BCCND, (iii) if Be B, 
and y € B, then there is a C € B, such that C C B. Then the systems 
B,, x € X, are local bases of the topology 7(B), where B := Uzex Br, and 
B is a basis of this topology. 


81. Prove: (a) if a topological space (X,7) is separable, then (X,7’) with a weaker 
topology, 7’ C 7, is also separable, 


(b) a second countable topological space is separable, 
(c) a metric space is separable iff it is second countable, 


(d) the topological spaces (X,Tfin) and (X, Tcount) of Example 1.2.3 are not first 
countable; the former is separable while the latter is not. 


82. Let 71 C 7™ be two topologies on a given X . Prove 


(a) (M),, C (M),, for any MCX, 


(b) anet {xq} which converges in (X,72) converges in (X,7,) too. 
83. Let (X,7) and (X',r’) be topological spaces. Prove 


(a) ifa map f : X — X”’ is continuous at a point x € X and a sequence 
{tn} C X converges to x in (X,7), then {f(an)} converges to f(x) in 
(X',7’). If (X,7) is first countable, also the converse is true (compare to 
Problem 19a and Proposition 1.2.4b), and the closure points of a set M Cc X 
can be characterized by sequences as in Problem 17b, 


(b) let B, Cr, BY, Cr’ be local bases at the points x and 2’ := f(x); then f is 
continuous at x iff for any B’ € Bi, the pull-back f‘-!)(B’) contains some 
BeB,, 


(c) if f is continuous, we have f(Mf) c f(M) for any MCX. 
34. Let (X;,7;), 9 = 1,2, be topological spaces. Prove 


(a) Tx,xX2 is the weakest topology, with respect to which the maps Gy) = 
2j, J =1,2, are continuous, 
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(b) in the topological product, we have M; x Mz = M, x Mg for any M; C X;. 


35. A topological space (X,7) fulfils the axiom 73 iff to any neighborhood U(x) of 
a point x € X there is a neighborhood V(x) such that V(x) C U(z). 


36. A family C C 2* is called centered if any finite subsystem of it has a nonempty 
intersection. A topological space (X,7) is compact (o-compact) iff any (countable) 
centered family of closed sets in it has a nonempty intersection. 


37. Let X be a topological space. Prove 
(a) if X is a T;-space, x € X is an accumulation point of aset MCX iff its 
arbitrary neighborhood contains infinitely many points of the set M, 


(b) if X is a T\-space and any infinite set M CX has an accumulation point, 
then X is o-compact, 


(c) a second countable space is compact iff it is o-compact. 


Hint: (b) Given a centered family {F, : n = 1,2,...}, construct the sets Ry := 
an, F,,, pick a point xy from any Ry and show that we can select an infinite set 
{xn, : k = 1,2,...} whose accumulation point belongs to all Ry. (c) Any open 
covering of X has a countable subcovering. 


88. If X is a connected topological space and a map f : X — X’ is continuous, then 
f(X) is a connected set. 


39. A vector space V equipped with a topology 7 such that the summation and scalar 
multiplication are continuous satisfies the axiom T3 ; this means that (tv3) may be 
replaced by the T)~separability requirement. 

Hint: For any U(0) there is U’(0) such that x,y € U'(0) implies x—y € U(0). 
From here we get U’(0) C U(0), which allows us to use Problem 35. 


40. Let V bea vector space and By C 2* a family with the following properties: 


(i) 0€ B forall BE By, 
(ii) for any pair C,D € Bo there is a B € Bo such that BCCND, 
(iii) if B € By and y € B, then there is C € Bp such that y+CcB. 
Then the family B, := 2+Bpo for any x € V is a local basis of the weakest topology 


containing B := Urey B, and B is a basis of this topology. Under additional 
conditions (see Theorem 1.4.2), (V,7(B)) is a topological vector space. 


41. Let V bea topological vector space. The closure of a subspace in V is a subspace, 
and the intersection of any family of closed subspaces is a closed subspace. 
Hint: Use Problems 33c and 34. 


42. Let (V,T) be a topological vector space; then 


(a) if U(0) is a neighborhood of zero, then for any x € V there is an az > 0 such 
that ax € U(0) holds for 0 < Jal < az, 
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44. 


45. 


46. 


47. 


48. 
49. 


50. 
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(b) aset M CV is called balanced if M = Ujaj<1 2M . Prove that any neighbor- 
hood U(0) contains a balanced neighborhood V(0) of zero. 


Hint: Use continuity of scalar multiplication. 


Let 9 be the metric of Problem 16, then the corresponding topology coincides with 
the locally convex topology generated by the family. A sequence {rn} C V is 
Cauchy in the metric space (V, 9) iff to any n and e€ > 0 there is a j(€,n) such 
that pr(ap—22) < € holds for all k,£> j(e,n). 


Prove the relation (1.1). 

Hint: Any f € SUR") can be approximated with respect to all seminorms || - ||j,« 
by the functions f. : fe(x) = f(x)h(ex), where h is a function of C™°UR™) such 
that h(x) = 1 for |2| <1. 


Prove Theorem 1.5.1. 
Hint: A Cauchy sequence {x,} contains {x,,} such that ||rzx,,,—Zk,|| < 27”. 


A complete envelope (V, 0) of a normed space V is a Banach space which is unique 
up to linear isometries preserving V . In particular, if V is a pre-Hilbert space we 
can introduce on V the inner product (-,-)« such that 9(z,y) = /(r—y,2—y)s- 


Prove: (a) If y is a o-finite measure on a o-algebra A C 2*, then the set 
{xm : M € A, p(M) < oo} is total in L?(X, dy), 
(b) the set S( of Example 1.5.3c is total in L?(IR", dy), 


(c) if J is a bounded interval in JR”, then there is a sequence {f,} C C§°(IR") of 
functions with supports in J, |fx(x)| <1, which converges to yy pointwise 
in the interior of J, as well as in the || - ||, norm, 


(d) let f € L?(J,du), where J is a bounded interval in IR" and y a Borel 
measure; then there is a sequence {f,} C C§°(UR") such that supp f, C J 
and || fx —fllp 7 0. 


Hint: (a) If u(X) < oo, one has L? c L! and to any f € L! there is a o-simple 
function ge such that |f(x)—ge(x)| <¢ ae. in Af, (b) for any M € B™ and 
€>0 there is R ER” such that w(RAM)!/? <e for allae X. 


The space (Co.(X), ||- |loo) is complete and Co(X) is dense in it. 


Let *# be a Banach space. If Z is a closed subspace in 4’, prove that ||#||z := 
infzez ||z—z{| is a norm on the factor space ¥ := ¥/Z and the corresponding 
normed space is complete. 
Hint: Use Theorem 1.5.1. 


Let B:V, — V2 be a linear operator; then the following conditions are equivalent: 
(i) Be B(V,,V2), 
(ii) B is continuous, 


(iii) B is continuous at a point x € V,. 
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The operator norm can be expressed equivalently as 


Ba , 
i cap Bxl, = int Cp, 
eV \{0} [lz] ze B, (0) 


where B,(0) is the closed unit ball in the normed space V; and Cg is the set of 
all c such that ||Ballo < cllz||, for all ee V,. 


Show that the Fourier transformation maps S(JR”) onto itself, and check that 


I. e'*%a(y) f(y) dy = uE ou) F(a+y) dy 


holds for any f,g € SCR") and x € IR”. 
Hint: To estimate y7(D* f)(y), use an integration by parts together with the fact 
that 2+ (1+/2|?")-! belongs to L!(JR"). 


The Fourier—Plancherel operator is surjective, Ran F = L?(JR") , and its inverse 
F-' ig the continuous extension of F)!. 


Let V be a normed space. Prove that 


(a) the dual spaces V* and Vf coincide iff dimV <0, 


(b) if dimV =o, there are unbounded linear functionals on V . 


Hint: (b) Choose a countable subset {e;} in a suitable Hamel basis of V and define 
{[:V—-@ by f(e;):= 7 and f(z) =0 outside {e;}1in. 


Let V beanormed space. If the functionals J; on V* aredefined by Jz(f) := f(x), 
then the map r++ J; is a linear isometry of V to a subspace in V**. 


Let (V,7) bea finite-dimensional topological vector space, then the weak topology 
Tw coincides with rT. 


Given a Banach space %, operators B,C € B(#), and sequences {Bn}, {Cr} C 
B(#), prove 


(a) if Bax + Bx and C,x— Cz, then B,Cyr — BCz, 


(b) if Bay — By for all y from a total set in 4 and the sequence {||Bnl|} is 
bounded, then B,2 — Bz for all re. 


Prove the inverse-mapping theorem. 


Define the operator T from ¥ = C'/(0, 1] to itself with the domain Dr:={feEer: 
f' € X#} by Tf := f’. Show that T is unbounded and closed, and that the same 
is true for its restriction T to DO :={f € Dr: f(0) =0}. 


Let ¥, ¥, Z be Banach spaces. Prove 


(a) a linear operator T : % — ¥ is closable if there is a closed S: ¥ — ) such 
that I(T) C T(S) ; in that case TCS, 
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(b) if 7: X% — ¥Y is closed and invertible, then T~! is a closed linear operator from 
y to X. The operator T is unbounded if there is a sequence {yn} C RanT 
such that T’-!y, + 0 and {yn} does not converge to zero; on the other hand, 
if T is unbounded, then there is a sequence of unit vectors yn € RanT which 
has no accumulation points and fulfils T~'y, — 0, 

(c) let S: % >) be a closed linear operator; if T : Y — Z is linear, invertible, 
and T~! € B(Z,y) ; then TS: ¥ — Z is closed. 


Define T : L?(0,1) + L7(0,1) with the domain Dr :=C/{0,1] by (Tf)(x) = f(0)z. 
Prove that T is neither closed nor closable, and find an eigenspace of T’ which is 
not closed. 


Let T € C(4), then the first resolvent identity (or Hilbert identity), 
Rr(d) — Rr(u) = (A-#) Rr(A)Rr(u) ; 
holds for any 4, € p(T) and the operators Rr(A), Rr(u) commute mutually. 


Let T,S €C(#) with Dr C Dg and »€ p(T) p(S) ; then the second resolvent 
identity, 

Rr(\) = Rs(A) — Rs(A)(T—-S)Rr(), 
is valid. Moreover, let T—S =UV, where Dy = Ds, Du = VDr, RanV =2, 
and V is invertible, then the operator J+ VRgs(A)U is also invertible and 


Rr(d) = Rs(A) — Rs(A)U(I+-VRs(A)U)“'V Rs(A). 


Given an operator T € C(a’), we define its regularity domain r(T) as the set of 
all A € @ for which there is c(A) > 0 such that ||(T—A)a|| > c(A)|\x|| holds for all 
zx € Dr. Prove that 

(a) 2(T) is an open set, 

(b) Ran(T—A) is a closed subspace in 4 for each \ € x(T), 

(c) p(T) C x(T). 


The operator Q on C[0,1] defined by (Qf)(x) := af(xr) is bounded, ||Q|| = 1, 
and we have o,(Q) = o-(Q) =@ while o,(Q) = [0, 1). 


Given a bounded operator B on a Banach space 2, we can define e? € B(%) as 
the norm limit e? := limy—soo DA_, 4 B®. Prove that 


(a) le? || < ell, 
(b) BC =CB implies eB + = eBeO = e eB | 


Bz 


(c) the map z++ e”* is continuous with respect to the operator norm. 


Chapter 2 


Hilbert spaces 


2.1 The geometry of Hilbert spaces 


In Section 1.1 we introduced pre—Hilbert spaces and derived some of their simple 
properties, in particular, the Schwarz inequality, parallelogram identity, and polari- 
zation formula. A pre-Hilbert space which is complete with respect to the metric 
induced by the inner product is called a Hilbert space. Any such space is at the 
same time a Banach space, so it has all of the properties discussed above. 

If two Hilbert spaces are linearly isometric they have the same algebraic and 
metric properties; we usually say briefly that they are isomorphic. By polarization, 
an isomorphism of Hilbert spaces preserves the inner product. Furthermore, the 
standard procedure of Theorem 1.2.2 allows us to complete any pre—Hilbert spa- 
ce, uniquely up to an isomorphism, to a Hilbert space (cf. Problem 1.46). On 
the other hand, not every notion is adapated from Banach-space theory without a 
modification. For instance, given Hilbert spaces H;, 7 = 1,2, we can equip their 
vector-space direct sum with the inner product defined by 


({r1, 22], (yr, Yo) = (@1, Yada + (Xe, Yo)o 5 


it is easy to check that the pre-Hilbert space obtained in this way is complete, 2.e., 
a Hilbert space which is called the direct sum of the Hilbert spaces Hy, Ha and 
is denoted as H,; @ H2. It does not coincide with the Banach-space direct sum 
of H, and H, though, of course, they are topologically equivalent (see the notes 
to Sec.1.6). The construction extends to any finite number of Hilbert spaces; for 
infinite families the direct sum will be introduced in Section 2.3 below. 

Using the Schwarz inequality, we can check easily that the inner product in 
is jointly continuous, 7.e., it is continuous as a map H@H — C'. Combining this 
with the properties of the orthogonal complement mentioned in Sec.1.1 we get 


Mt = (yin = (Min) 


In particular, the orthogonal complement to any subset of a Hilbert space is a 
closed subspace. The existence of an inner product together with the completeness 


Al 
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requirement allow us to derive for any Hilbert space several results which have a 
simple geometric meaning. 


2.1.1 Proposition (orthogonal-projection lemma): Let G be a closed subspace in 
H ; then for any xz € H there is just one vector yz € G such that the distance 
o(z,G) = ||z—yell. 

Proof: By definition, o(z,G) := d := infyeg ||z—y|| , so there is a sequence {yn} C G 
such that limpoo || —yn|| = d. The parallelogram identity implies that {yn} is 
Cauchy, ||¥n—Ym||? = 2\|]z—gyall? + 2l2—Ymll? — 4| 2 — Y+Y%m)/2|? < 2l|2—ynl|? + 
2\|z—ym||? — 4d? + 0 for n,m — oo. Since G is closed, {yn} converges to some 
Ye €G and limp ||x— Yyn|| = ||2—yz||. The uniqueness is checked again by the 
parallelogram identity. 


The vector y, is called the orthogonal projection of x onto the subspace G. 


2.1.2 Theorem (orthogonal-decomposition theorem): Let G be a closed subspace 
in  ; then to each vector x € H there is just one y € G and z € G* such that 
r= ytz. 

Proof: The uniqueness follows from G G+ = {0} (see Problem 2). To prove the 
existence, we choose for y the orthogonal projection of x onto G, so it is sufficient 
to check that z := x—y € G+, or equivalently, (e,z) =0 for any unit vector eEG. 
We have |{z|| = infyeg ||z—y’|| by the orthogonal—projection lemma; hence choosing 
y =y+(e,z)e we get |lz||? < lz — (e,z)ell? = |lz\|? — |(e,z)|?, which is possible 
only if (e,z)=0. &§ 


The theorem thus ensures existence of the orthogonal decomposition H = G@G+ for 
any closed subspace G C H. Combining this with the properties of the orthogonal 
complement (Problem 2) we get the following useful criterion. 


2.1.3 Proposition: A set M C H is total iff M+ = {0}. 


We saw in the previous chapter that the problem of finding the dual of a given 
Banach space has no universal solution and may be difficult in some cases. In Hilbert 
spaces the situation is much simpler. 


2.1.4 Theorem (Riesz lemma): To any f € H* there is just one vector ys € H 
such that f(x) = (ys,z) holds for all c € H. The map f + y; is an antilinear 
isometry of the spaces H* and H. 


Proof: Since f is linear and continuous, Kerf is a closed subspace in H. If 
Ker f = H we have f =0 and y; =0. In the opposite case there is a unit vector 
y € (Ker f)+ ; then z, := f(x)y — f(y)a belongs to Ker f for any x € H. Hence 
0 = (y, 22) = f(x) — (y,z)f(y) and we can choose ys = f(y)y. The uniqueness of 
ys and the injectivity of the map f ++ yy follow from the fact that H+ = {0} ; 
the map is at the same time surjective because (y,-) is a bounded linear functional 
for any y€ H. The relation f(x) = (ys,x) implies |[f|| < |lys|| by the Schwarz 
inequality; on the other hand, setting r=yy we get |lyy||? = f(ys) < II fll llyrl|, so 
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together we have ||f|| = |ly;||. 8 


In view of Theorem 1.5.12a, a sequence {zn} in a Hilbert space H converges 
weakly to a vector x iff (y,t%n) — (y,x) for all y € H. This yields the following 
useful result. 


2.1.5 Proposition: If a sequence {x,} C H converges weakly to x € H and 
\|z,|| — ||x|], then it also converges to x in the norm topology. 


We have mentioned the role of bases in a finite-dimensional vector space. Using 
the inner product, we can extend this notion to infinite-dimensional Hilbert spaces. 
Let € := {e, : a € I} bean orthonormal set in H, not necessarily total. The 
numbers &, := (€e,2) for a given x € H are called Fourier coefficients of the 
vector x with respect to €. Let {a; : j = 1,2,...} be an at most countable 
subset of the index set I ; then the Bessel inequality, 


X |(€as:2)) < lel? 


is valid (Problem 1). Using it, we can easily derive the following properties of 
orthonormal! sets. 


2.1.6 Proposition: (a) The set X of nonzero Fourier coefficients of a vector x 
with respect to a given € is at most countable. 


(b) Any orthonormal set {e;} fulfils w-limj..e; = 0. 


Proof: The sets Xn := {f&2: a € 1, |&| > n71}, n = 1,2,..., are finite, so 
X = Ux, X, is at most countable. The series 3}; |(e;,r)|? converges, which 
implies that lim;...(e;,z) =0 forany reH. FE 


Since the sequence {€,} of nonzero Fourier coefficients fulfils 732, |€a,|? < ||all?, 
the sequence of the vectors Yn := 71 €a;€a; is Cauchy. Its limit, which we denote 
as y, belongs to the closed subspace &;, , and we can easily check that (e,,z—-y) = 0 
holds for any a € I, i.e., zy € E+. This means that y is the orthogonal projection 
of x onto &,, and therefore 


22 2 
xl? = ly? + la—yll? = > |e] + Ileal? 


j=l 


In particular, if € is a total orthonormal set in 7H, we have 
ca — i, =" eeeaeeand lz) = >. |E,,| 
j=l j=l 


The first of these relations shows that the sum of the series on the right side is 
independent of the order of the terms. This is true even in the case when € is not 
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total: if {€,} is another sequence formed of the set of nonzero Fourier coefficients of 
the vector x, then z := limno KZ €4,ea, is again the orthogonal projection of 
x onto Em, and therefore z = y by Proposition 1. In this way, total orthonormal 
sets allow us to characterize any vector x € H by means of its Fourier coefficients. 
The set {€a,: j = 1,2,...} in the above formulas depends in general on x but 
this fact is not important. If H is separable then any orthonormal set in it is at 
most countable (see Proposition 2.2.1 below) and we get 


oo oo 
z=) (e;,z)e, llall? => \(e;,2) |; 
j=1 j=l 


the first relation is called the Fourier expansion of the vector z, the other is the 
Parseval identity. In a nonseparable H the sets of Fourier coefficients are not 
countable; then it is sufficient to replace the sequences by the corresponding nets, 


t= lim 2 (Ca, 2)€q = Da (eapeven, 


ack acs 
{||| = lim 3 Kee)? = sup yy \Kewmac) |? = > (canna)? . 
aék KES gek aces 


where S is the family of all finite subsets in J ordered by inclusion; the relations 
are now checked as in Proposition 3.2.13 below. 

Total orthonormal sets may serve therefore as the sought generalization of the 
(orthonormal) bases in finite-dimensional vector spaces; any such set will be called 
an orthonormal basis. In view of Proposition 3, an orthonormal set € is an 
orthonormal basis iff €+ = {0}, or equivalently, if it is maximal, i.e., it is not a 
proper subset of another orthonormal set. 


2.1.7 Theorem: Any Hilbert space 7 has an orthonormal basis. Any two bases 
in a given 7 have the same cardinality. 


Proof: The family S of all orthonormal sets in # is partially ordered by set in- 
clusion; then Zorn’s lemma implies existence of its maximal element €. If there 
were a nonzero x € €+, then € would be a proper subset of the orthonormal set 
EU{zx/|\x||} , but this contradicts its maximality. The other assertion can be proven 
using Proposition 6 (see the notes). 


Hence we may introduce the Hilbertian dimension of H as the cardinality of any 
orthonormal basis of it; note that it need not coincide with the algebraic dimension 
of # unless both of them are finite — cf. Problem 4. Unless stated otherwise, the 
term dimension in the following always means the Hilbertian dimension. 


2.1.8 Theorem: Hilbert spaces H, H’ are isomorphic iff dim H = dimH’. 

Proof: If dim = dim H’ there are orthonormal bases €, €' of the same cardinality, 
so we can construct a bijection Vo: € — €’. Its linear extension is a bijection from 
Ein to €j;, which preserves the inner product. Hence it is bounded, and since 
Ein = H, it has a unique continuous extension V € B(H,H’) ; it is easy to check 
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that it is isometric and is therefore the sought isomorphism. On the other hand, if 
there exists an isomorphism V : H — H’ and € is an orthonomal basis in H then 
VE is an orthonormal set of the same cardinality as € , which is moreover total in 
H’ because (VE)+ = VEL={0}. 


2.2 Examples 


Since Hilbert spaces represent the raw material for most of the problems discussed 
in this book, it is appropriate to list now those which will appear frequently. Since 
most of them are separable, the following result is useful. 


2.2.1 Proposition: Any orthonormal set in a separable H is at most countable. 
A Hilbert space 1 is separable iff it has at most a countable orthonormal basis. 


Proof: If € := {e,: a € I} is an orthonormal set then {U.(e,) : a € I} with 
é € (0,271/?) is a disjoint set of open balls in H of the same cardinality as € , so the 
first assertion follows from Problem 1.18. Together with Theorem 2.1.7, this implies 
in turn the existence of an at most countable orthonormal basis in a separable H, 
while the opposite implication follows from Lemma 1.5.2. 


This means, for instance, that the space @ represents a canonical realization of 
an infinite-dimensional separable Hilbert space. It would be a mistake, however, to 
conclude that it is sufficient to know one Hilbert space of each dimension only. The 
problem is that we are usually interested in an operator or a family of operators on 
a given #H whose properties may be derived easily in one realization while being 
disguised completely in another one. With a certain overstatement, we can say that 
if Dirac had thought of operators only as infinite-dimensional matrices he would 
never been able to formulate the equation of motion of a relativistic electron. 

In the following examples, we shall construct countable orthonormal bases in 
some frequently used Hilbert spaces, thereby proving their separability. 


2.2.2 Example (the space L?(a,b), b-a < oo ): In view of the obvious isomorphism 
we may consider, e.g., the space L*(0,27). The set Ep := {e,: k =0,+1,+2,...} 
with e%(x) := (2)~'/*e** is orthonormal; we shall show that it is total. Let 
f € €¢. The function g: g(x) = Jo f(s) ds + C is absolutely continuous in (0, 27] 
and an integration by parts yields 0 = (ex, f) = (27)~!/?[g(27) —g(0)] — ik(ex, 9) 
for k = 0,+1,+2,.... This implies, in particular, g(27) = g(0) = C,, and choosing 
C so that 7" g(x) dx = 0, we get (ex,9) = 0 for k = 0,+1,+2,.... By Fejér’s 
theorem, g can be approximated uniformly by trigonometric polynomials: for any 
€ > 0 there is T. = Df_, Oeee such that ||g—T.l|. <e€. Since (T-,g) =0, we 
get 
llgll? = (g—Te,9)| < llg—Tell llgll < ellgll, 

so g = 0; in view of its continuity we have g(x) = 0 for all x € [0,27], and 
therefore f(x) =0 ae. in [0,27], which is what we set out to prove. The set &p 
(or a sequence arranged of it) is called the trigonometric basis in L?(0,27). 
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In a similar way, one checks that the set {z': 1 =0,1,...} is total in L?(a,6) ; 
since it is linearly independent it can be orthogonalized by Theorem 1.1.5. This 
yields, in particular, the orthonormal basis Ep := { (/ + 4)1/2 Ree: boeeOphyes. } im 
L?(—1,1), where 


aie 
Pp: B(x) = (2'1) ‘a =i) 
are Legendre polynomials. 


2.2.3 Example (the spaces L?(JR) and L?(IR*) ): Let us start with L°UR). We 
again want to contruct orthonormal bases by ulereeameentio of a suitable total 
linearly independent set; this time we use J‘) := { f{* : k = 0,1,...}, where 
(A) (q) r= |g|*+!/2e-27/2¢k | We assume a > —1, so J) is contained in L?(iR); 
it j (a)\* Gk (4 (2) 
we shall show that it is total. If g € (T ) we have )’k-0 “Fi (9. fk ) = 0Mfor 
any real \ and a natural number n, and therefore 


[eo o(@)4(@)de=0, AER, 


by the dominated—convergence theorem, or F(g fe”) = 0. Since F¥ is injective (see 
the notes to Sec.1.5) and f(x ) > 0 for all z € R, we conclude that g(x) = 0 
a.e. in IR. Gram-Schmidt procedure applied to T‘ then yields the orthonormal 
basis €{° := { 9) : n=0,1,...} with 


(a) ki ne 
Aa) = (Res) beter 102) 


k+a+l 
(a) k! ae 1/2_,—x?/2 (a+1) 
— ee Or —2x a 2 
fate) = (ate) lait terra’, 

where L : L(r) = ep od (enwauie) are Laguerre polynomials. In the case 
a = —1/2 we conventionally use the functions hy := (—1)"/4lg(-1/2) where [-] 
denotes the integer part, or 

Ia) = (2° nl)" “M4 e277 Hae) (2.1) 


which express in terms of Hermite polynomials Hy: Hp(x) = (—1)"e™ oe lial (see 
also Problem 6). As for the space L?(JR+), one can use the results derived above 
to check that FO) = {¢) :n=0,1,...} with 


1/2 
we)(n) = (—™ _\ "0/2 5-2/2 pa) 

ae (5 mar) a 
and a > —1 is an orthonormal basis (Problem 7). 


Given an orthonormal basis in L?(JR) we can construct orthonormal bases in the 
spaces L?(JR"), n = 2,3,..., and at the same time check their separability, using 
the following more general serait 
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2.2.4 Proposition: Let pu, v be o-finite non-negative measures on sets M, N, 
respectively, such that the spaces L?(M,du) and L?(N,dv) are separable (see the 
mites romeo Ss) ="{eprmkr=—n1)2)... foand) F™==4{ fF, : bt, 2;:..} be or- 
thonorma!] bases in these spaces. Then L?(M x N,d(w@v)) is separable and the 
set {gm : k,l =1,2,...} with gw: gu(x,y) = ex(x)fi(y) is an orthonormal basis 
in it. 

Proof: By the Fubini theorem, {gi} is an orthonormal set in L?(Mx N,d(u@v)) =: 
L2,, so it remains for us to check that it is total. Let (h, gm) = 0 for h € L2, and 
k,l=1,2,.... The y-cut of the function h, h¥: h¥(x) = h(x, y), then belongs to 
L?(M,du) for v-aa. ye N, and H: H(y) = ||h®||? belongs to L'(N,dv). The 
assumption can be thus written as 


[, Haw) avy) =0, 1 =1,2,..., 


where Hy(y) := (€x,h¥). Due to the o-finiteness of v, the functions H;, are 
measurable, and the Schwarz inequality yields |Hx(y)|? < ||h¥||? = H(y) for v-a.a. 
y€N,so H, € L*(N,dv). Since € and F are orthonormal bases, the relations 
(Ax, fi) = 0 imply Ay(y) = 0 for v-a.a. y € N, and this in turn means h¥ = 0 
for v-a.a. y € N. Hence we get ||h||2, = fy ||AY||?dv(y) =0,2e,h=0. 


A frequently used generalization of the spaces L?(X, dy) is obtained if we replace 
complex functions by functions with values in some auxiliary Hilbert space. 


2.2.5 Example (vector-valued functions): Let G be a separable Hilbert space and 
ji a non-negative o-finite measure defined on a o-algebra AC 2*. It is easy to 
check that the functions f : X — G with the pointwise defined summation and 
scalar multiplication form a vector space; we denote it as V(X,G) or simply V. A 
function F’ is measurable if the complex-valued functions (g, F'(-))g are measurable 
for all g € G. The set of all measurable vector-valued functions in Y is a subspace 
VY, C VY ; using the Parseval identity for some orthogonal basis in G we find that 
|F()|l8 is measurable for any F € V,. Consider next the set 


£4X,dusG) =| FEV, + f MPO) Au(2) < oS, 


which is a subspace in V, as can be seen from the inequality ||F(x)+G(z)||% < 
2(||F (x) 12 + ||G(x) |g), and its subspace Lo := { F € V,: F(z)=0 p—ae.in X }. 
The factor space 

L?(X, du; G) = £L°(X,du;G)/Lo 


thus consists of the equivalence classes of functions that differ mutually on a p—zero 
set; with the usual license we shall refer to them again as vector-valued functions. 
L?(X,du;G) is obviously a pre-Hilbert space with respect to the inner product 


(FG) = [| (Fle), G(e))o dua). 
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It is moreover complete as we shall see in Example 2.3.4 below; we call it a Hilbert 
space of vector-valued functions. In the case G = C this space, of course, coincides 
with L?(X, dp). 


In the rest of this section we are going to discuss another important Hilbert space. 
Consider the measure pig : fg(M) = = fy edz on © where dz = d(Re z) d(Im z) 
refers to the Lebesgue measure. By "A?(C) we denote the pre-Hilbert space of all 
functions which are analytic in ( and square-integrable with respect to pw, , with 
the inner product defined as in L?(C, dug), i.e., 


(f.9) = = [ Fe@gl(2)e Fas. 


In distinction to L?, the elements of A?(€) are continuous functions; hence f = g 
means f(z) = g(z) for all ze C and (f,f) =0 implies f =0. 

We want to show that A?(C) is complete with respect to the metric induced 
by the inner product. To this end, we consider the subspace P Cc A?(€) of all 
polynomials on C and the Hilbert space P obtained by the standard completion of 
P . We can check directly that the functions Un: Un(z) := 2"(n!)"/?, n=0,1,. 
form an orthonormal set. Since this is a Hamel basis in P, and therefore Cte) in 
P, the space P is separable. 


2.2.6 Lemma: Let P € P ; then the relations P(w) = (ew, P) and |P(w)| < 
||P || el?/2 hold for w € C, where e,,(z) := €® 

Proof: We can compute directly that |lew|| = e!'’/?, so the second relation fol- 
lows ow the first; it is sufficient to check the latter for P = u,. The inequality 
pa 0 H (wz)* | S ela valid for any N and z € @ allows us to use the dominated- 
convergence theorem, which yields 


[, wnlz)eul2) dyug(2) = Jim So i “7h Un(z)te(2) dilg(2) = un(w). 


Now we are able to prove the basic properties of the space A?(C). 


2.2.7 Theorem: (a) A?(C) is a separable Hilbert space. 
(b) Let f € A?(C) and we C; then 


Fw) = (ews f) = — f Heyeal@ePP ae. 


(c) The norm convergence in A*(€), ||f.—f|| + 0, implies the pointwise conver- 
gence, f,(z) > f(z) forall ze C. 


(d) A*(C) contains just those analytic functions f = Se cee for which 
Dexo |cx|? < 00. 
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Proof: To prove (a) we have to check that A?(C) =P. Let {P,} C P be a Cauchy 
sequence; then the corresponding sequence of jz,—-equivalent classes is Cauchy in the 
Hilbert space L?(€',du), so there is a class [F] € L? such that ||F—P,|| > 0 
and a subsequence {P,,} converges to F pointwise uy—almost everywhere (cf. 
Example 1.2.1). The above lemma then implies that limp. P,(w) = f(w) exists 
for all w € € and the convergence is uniform in any compact K C €’. Hence f 
is holomorphic (see the notes), f € [F] and f € A?, i.e, P Cc A®(€). To prove 
the opposite inclusion, we ee note that P+ = {u,: n=0,1,...}+, so a function 
f © A2(C), f(z) = Deo anz", belongs to P+ iff fy Remy aullz) = 0 for all 
n. Since fu, € C(C,du) and the series converges uniformly on compact sets, we 
get the relation 


Ok kon = 
0 = (um f) = Jim So ef Ae dul) = anv 
for any n, i.e., A?(C)N P+ = {0}. Combining this with the inclusion P c A?(C), 
we arrive at the sought result. 
The reproducing property of part (b) is valid for each polynomial. For any 
f € A?(€) there is a sequence {P,} C P such that ||f—P,|| > 0. Repeating 
the above argument we find that P,(w) — f(w), so (b) follows from inner—product 
continuity; using the Schwarz inequality, we get assertion (c). Finally, we have 
shown that {un : nm = 0,1,...} is an orthonormal basis in A?(€), so any f € 
| ac) = be written in the form of the Fourier expansion, f = )0?2o Cnun, with 
If ae o ) lcn|?. On the other hand, if a function f is holomorphic and fulfils 
©» In|? < oo then the non-negative polynomials Qn(z) := | Deo ceur(z)? | 
fulfil Q,(z) — |f(z)|? for all z€ C and 


J Qnl2) dpig(2) = y lel? < » lex? < 00, 


so f € L?(€,du,) by Fatou’s lemma. [ff 


The space A?(€) is called Hilbert space of analytic functions; the same name 
is used for the spaces of those functions which are analytic and square integrable in 
some region G C C (see the notes to this section and to Sec.8.1). The space A?(C) 
is naturally embedded into L?(C, dug), 1.e., it is linearly isometric to its subspace. 
Since it is a proper subspace it has some special properties, e.g., that expressed 
by assertion (c). The most important among them is the reproducing property of 
assertion (b). It implies, in particular, that the set C := {e, : w EC} is total 
in A?(C’) because C+ = {0}. However, C is not orthonormal and it cannot be 
orthogonalized as a uncountable set in a separable Hilbert space. The elements of 
C , which will later be called coherent states, have the following remarkable property: 
substituting f(z) = (ez, f) to the formula of Theorem 7b and using e,(z) = e,(w) , 
we get the relation 


fw) = = [(enf) e,(w) e#P/2dz, (2.2) 
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which can be regarded as a continuous analogy of the Fourier expansion. 


2.2.8 Remark: Coherent states represent a more general concept; we return to this 
topic in Section 8.1. In some Hilbert spaces, however, families of coherent spaces 
are obtained as an isomorphic image of the set C. As an example, consider the 
operator V : L?(JR) — A?(C) obtained by linear and continuous extension of the 
map Vo: Vohy= tn, 2 =0,1,..., where the Aa are the vectors (2.1). As in the 
proof of Theorem 2.1.8, we can cyl that V is an isomorphism of the two spaces. 
The vectors Wy = el? /2y-1e,, thus have unit norm for any w € @ and 


dy (x) = ae el@?-lwl?—(2—-v20)?1/2 . (2.3) 


they are again called coherent states. 


2.3. Direct sums of Hilbert spaces 


In Section 2.1 we introduced the notion of direct sum for a finite system of spaces. 
If all of them happen to be mutually orthogonal subspaces of a given Hilbert space, 
then there is a simple way to characterize the direct sum topologically. 


2.3.1 Proposition: Let G1, Ge,...,Gn be closed mutually orthogonal subspaces of 
a given H; then @}., G; is the minimal closed subspace containing all G;. 


Proof: By definition, Hy := Or, G; = 1@ = pe my : 2; CG, }, so it is con- 
tained in any subspace that contains all G;. It remains for us to check the closed- 
ness, i.e., that limp.o2Zn € Hw holds for each Cauchy sequence {xr'} C Hy. 
The decomposition x‘) = ae x; € G;, together with the orthogonality 
of the subspaces, implies ||2 —x2(™ |? = DX, lac =") |? so the sequences 
{x }e2, are also Cauchy, and due to the closedness of the subspaces each of 
them converges to some z; € G; ; the vector x := ion =,2; € Hy then fulfils 


limpoo Iz - all? = Vitae Spo =1 yee” =e," =0. @ 


The space Bei G; is called the orthogonal sum of the subspaces G,,...,Gy . Propo- 
sition 1 allows us to extend this notion to any family {G,: a € 1,Ga # {0}} of 
subspaces in a given H: we define the orthogonal sum 3>®.,; G, as the minimal 
closed subspace containing all G,. 

For a finite system of subspaces, the notions of direct and orthogonal sum are 
thus equivalent, and any element of Bp G; can be expressed as the sum of its 
projections to the subspaces G;. This last property also remains essentially true in 
the general case. Consider the subspace (Uger Ga)jin » Which is called the algebraic 
sum of the subspaces G, and is denoted as )4¢,G_ (compare with Problem 1.5). 
If I is finite it coincides with ®.,;G., while in the general case we have 


Per eee es (2.4) 


aél ael 
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since the subspace on the right side is closed and contains all G. and hence also 
Deer Ga ; at the same time the relations G, C ®.;Ga together with the closedness 
of °®.;Ga yield the opposite inclusion. Combining the relation (2.4) with some 
simple consequences of the orthogonal—decomposition theorem (Problem 13) we get 
the sought expression of the elements of ©®,;G. in terms of the projections to 
the subspaces Ga. We formulate this for a countable J ; the generalization to an 
arbitrary index set is left to the reader (Problem 14). 


2.3.2 Theorem: Let {G; : 7 =1,2,...} bea family of mutually orthogonal closed 
subspaces of a given #1, then 


wi =09; = (ren: = fin San. 6b 
j= 


Proof: Denote the right side of the last relation as G. Obviously G C )°32 wee 
G C @§2,G; in view of the relation (2.4). To check the opposite ntamnsica take a 
vector 2 € Dj=1 G; and denote by x; its projections to G;. Je to Problem 13b, 
$= 212; is the projection of z to @5.1 9;, 80 |lSall? = Xe llayll? < [lz ||?, n = 
1,2,.... Problem 13c together with the already proven Melugion GC O79; 
implies that s, — s € OF, G; ; hence z—s € O72, G;. We have 


(omeyt/)o—= ie (+S 7 = (22,4) = 0 
j=l 


for any y € G,k = 1,2,..., because zx, is the projection of x to G,. Problem 13a 
now gives r—s € (@72,9;)*, so r=s,ie,xrEG. Ht 


2.3.3 Example: Consider the space H := L?(X, dy) , where p is a non-negative o— 
finite measure defined on A C 2*, and the family of the subspaces G; corresponding 
to a decomposition X = U2, X; into a system of disjoint finite-measure subsets, 
G; = {xx,f : f € L?}. The measures yy; : duj = xx,dp are finite and the 
spaces G; are obviously isomorphic with L?(X;,dyu;). This implies their closedness, 
G; =G;, while G; 1 G, follows from X;1 X, =@ for 7 #k. For any f eH 
we construct the sequence of vectors fn := Dju1 Xx f =xxmf, where X™ := 
Us_, X;. It converges pointwise to f and |[fnll < |lfll, so Ilfa—fll? < 4llFIP? ; 
the donaiffiited-convenseare theorem then implies | fn—f|| +0. Hence we get the 
decomposition L*(X,du) = F219; corresponding to X = Use, X;, where the 
spaces G; are isomorphic to L?(X;,d;) . 


In Section 2.1 we have described how one constructs the direct sum H;®H_2 to 
a given pair of Hilbert spaces H,, H2. The subspaces H) := {[a1,0] : x1 € Hy} 
and H) := {[0,22] : x2 € He} in Hi @ He are obviously orthogonal and closed, 
naturally isomorphic to H, and H2, respectively, and their orthogonal sum is 
Hi @ He. The above results thus give us an idea of how to extend the notion of 
direct sum to an arbitrary family of Hilbert spaces. 
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Let M :={Ha: a@€1} be such a family. We do not here require injectivity 
of the map a + H, ; the set M can even consist of copies of the same Hilbert 
space. The quantities referring to Ha will be indexed by a. By H we denote the 
subset in the Cartesian product XaesHa consisting of the maps a++ X(a) € Ha 
which satisfy the following conditions: 


(i) I(X):={a: x(a) #0} is at most countable, 


(ii) Daerxy |X (@)||2 < 00. 


The set becomes a vector space if we set (aX +Y)(a) :=aX(a)+Y(a),a€e€, 
and linear combinations satisfy condition (ii) in view of the Minkowski inequality; 
in the same way as for £? we can check that 


1/2 
Xr |X] = ( » ixcoo] 


€I(X) 


is anorm on H. Each of the norms || - || fulfils the parallelogram identity; hence 
the norm |j- || is induced by the inner product 


Ce a) a Le (X(a),¥(a))a } 


a€l(X)AI(Y) 


to check the correctness of the definition we have to employ the Schwarz and Hdélder 
inequalities to prove that the series is absolutely convergent, and the right side is 
therefore independent of the order of summation. Let us finally show that the 
space (H,|| - ||) is complete. Let {Xn} C H be a Cauchy sequence; the set 
Tix} = Use [(Xn) is at most countable, so we can write it as {a;: 7 =1,2,...}. 
The condition ||Xn—Xmll < € for n,m > n(e) implies that there is X(a;) € Ha, 
to any j such that X,(a;) — X(a;), and at the same time, the inequalities 
Dei IXn(az) — Xm(a;)\|5, < €? for N =1,2,.... Taking first the limit m — 00 
and then N — oo, we get D2, ||Xn(a,;) - X(ai)I2, <«? for n > n(e), so if we 
define X(a) := 0 for all indices a € I \ I;x,}, the vector X belongs to and the 
sequence {X,,} converges to it. 

The Hilbert space constructed in this way is called the direct sum of the spaces 
Ha and is denoted as 2.,;Ha. If J is a finite set, the conditions (i) and (ii) 
are fulfilled automatically and we return to the construction of Section 2.1. In the 
countable case, condition (i) can be dropped and the definition simplifies to the 
following form: j=, H; is the set of all sequences {x,;}52, with x; € H; such 
that 32, ||a,||7 < oo, with the componentwise defined operations of summation 
and oie maligna and the inner product 


(X,Y) = Stenu); ; 
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2.3.4 Example (vector-valued function revisited): We use the notation of Exam- 
ple 2.2.5. Let G be a separable Hilbert space of dimension N ; we shall show that. 
the corresponding Hilbert space L?(X,dy;G) of vector-valued functions is isomor- 
phic to Or, L?(X,d) , thus proving its completeness. We shall consider only the 
case N = oo since most part of the following argument is otherwise trivial. We 
choose a fixed orthonormal basis {e;} C G and define the complex—valued functions 
xt» F;(x) := (e;,F(x))g to any measurable vector-valued function F ; these are 
obviously measurable and fulfil ||F(x)\|3 = 4; |Fj(x)|? < oo for all « € X. Con- 
versely, the last condition together with the measurability of the functions imply 
that r++ F(x) := 0; F;(x)e; defines a measurable vector—valued function. 

_We want to show that F +> V(F) := {F;}52, is the sought isomorphism. 
Let F € L?(X,du;Q) ; then the relation ||F||? = fy 0; |F;(x)|?du(x) gives Fj € 
TA Xan) etoraje—ly 2,0. cand fy@a(x)dufa) < ||F||? for n= 1,2,..., where 
we set ®, := 07_, |F;|?. The monotone-convergence theorem implies the identity 
Fl? = X5 Sx | (@)|?du(x), so RanV C YP L*(X, du) and the map V is norm 
preserving. It remains to check its surjectivity. We have 


as du < } II? 
[Pode = of IFiPde < MH 


for any sequence {Fj} € Dy L*(X,du), so using the monotone-convergence the- 
orem once again, we get lim,..®,(x) = ¥,|F;(z)|? < 00 for paa «re X. 
It follows that the map x ++ F(x) := Xj Fj(x)e; is a measurable vector-valued 
function and 


2 a ; aT # Ay2 
[F@MRdu@) = f jim Pndu = Jim f bau = IFHIB, 


i.e., F € L?(X,dy;G) and {F;} =V(F). 


The spaces of vector-valued functions can be further generalized if we replace 
the auxiliary space G by a family of Hilbert spaces (x) labelled by a variable x. 
Suppose we have a measure space (X,A,) with a non-negative o-finite measure, 
a nonempty family R of separable Hilbert spaces anda map x++ H(z) from X to 
R. We define the “dimension function” by d: d(x) = dim H(z) and assume that it 
is measurable, i.e., Xn := d°-"(n) € A for n=1,2,...; then Xo := X \ US Xn 
also belongs to A. The elements of the set V := XzexH(x) are again called vector- 
valued functions and Y becomes a vector space when equipped with the pointwise 
defined algebraic operations. 

There are several ways to define measurability for these vector-valued functions 
(see the notes); we choose the simplest one. A set {E, : k = 1,2,...} C V is 
called a measurability basis if {F,(x)}?2, is an orthonormal basis in H(z) for any 


ge X and &,(x) = 0 holds if t € X, and k = n+1,n+2,.... To a given 
measurability basis we define measurable vector-valued functions as those F € V 
such that z + (E(x), F(x))2 is measurable for all k = 1,2,... ; they form a 


subspace V,,({E,}) in V. In the particular case when H(x) = G for all x € X the 
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measurability basis is formed by the constant vector-valued functions Ex(-) = ek, 
where {e,} is an orthonormal basis in G, and we arrive back at the definition of 
measurability from Problem 1la. Furthermore, V,({Ex}) is in fact independent of 
the measurability basis used (Problem 17), so we denote it as V, in the following. 

The map r+» H(z) together with the subspace V, C V is called a measurable 
Hilbert-space field on (X, A, 4). If two such fields have the same dimension function 
d they are essentially identical (Problem 17). Now we are able to introduce the 
notion of the direct integral of a measurable field (r+ H(z), V,) on (X,A,p) in 
analogy with Example 2.2.5: we set 


vA ” Fikec\ ala) em 


where C? := {FeEYV,: fy ||F(x)||2du(x) < co} is factorized with respect to the 
subspace Lp := {Fe V,: F(x) =0 p—ae.} ; the inner product at the factor 
space is introduced by 


(F,G) = f (F@),G@))edule). 


We leave to the reader to check the completeness and other properties of the direct 
integral (Problem 18). Note further that the direct sum considered above turns out 
to be a particular case of the direct integral referring to a discrete measure. 


2.3.5 Example: Let J := {z;: 7 =1,2,...} be a countable subset in X . Let py 
be a measure on X supported by J and such that u;(M) = card(MO/) for any 
M Cc X. All the vector-valued functions are obviously u;-measurable, V,, = V, 
for any map x++ H(z). Since y;(X \ J) = 0, each F € fP H(z) du;(zx) is fully 
determined by its values at the points z;,7 = 1,2,..., and moreover, we have 
FI? = D321 IF (as) [3 ; it follows that [2 H(x) dur(c) = @%, H(z). 


2.4 ‘Tensor products 


Let H,, He be arbitrary Hilbert spaces; then a pair (H,@®), where H is a Hilbert 
space and @: H, x H2 > H isa bilinear map, is called a realization of the tensor 
product of H; and Hz, if the following conditions are valid: 


(tl) (x@y,z'@y’') = (z,2’)i (y, y’)2 holds for all z,2' € H, and y,y/ € He, 
(t2) the set {x @y: XE Hi, y € He} is total in H. 
After formulating such a definition we must first check the question of existence. 


2.4.1 Theorem: For any pair H,, H2 there is at least one realization of their 
tensor product. 

Proof: Consider the vector space H; © H2, whose elements are linear combinations 
of the “objects” xoy with x €H, and y € H2 where the equality roy=a2'oy/ 
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means x = 2’ and y = y’, and any set of mutually different objects is linearly 
independent, so the set {roy : x € Hy, y € He} is a Hamel basis of H,oH2. To 
be concrete, we may take for H, © He the set of all functions f : Hy x Hz - C 
with f‘-)(€\ {0}) finite; the object zy is realized by the function e,,, such that 
€z,y(u,v) = 1 for [u,v] = [x,y] and er,(u,v) = 0 otherwise. Now let LD CH, oH 
be the subspace spanned by the elements of the form 


L =) aj ut; > Yk — (= on ° (Sau) ; 
ik j k 


For a given {x,y] € Hy x He we denote by roy the set of all f € Hi o He such 
that f=zey (mod L) ; this defines the map @: Hy x He > (HioH2)/L, which 
is obviously bilinear. 

Next we shall define on (H, © H2)/Z an inner product which would fulfil the 
axiom (tl). We associate the number s(roy,z’oy’) := (x, 2’); (y, y’)2 with any pair 
roy, z’oy ; since {xo y} is a Hamel basis the map defined in this way extends 
uniquely to a sesquilinear form s on Hi°H2 (Problem 20). Moreover, s(f,g) =0 
if at least one of f,g belongs to L ; from here it follows that s(fu,g.) for given 
u,v € (H,oH2)/L does not depend on the choice of the elements f,, € u and gy € vu. 
Hence the relation (u,v) := s(fu, gy) defines a form on (H,;oH2)/L which is easily 
seen to be sesquilinear. We shall show that it is strictly positive. We take an arbi- 
trary u:= 07,2; @y, and choose orthonormal bases {e1,...,€n,}, {f1,--+s fas} 
- in the subspaces {21,...,2n}un C Hi and {y1,...,Yn}iun C He, respectively. 
The expansions 2; = >, Cjre, and ye = Yisdkefs give U = VysQrsey @ f, with 
Ars = Yj Cjrdjs. Then (u,u) = Y,,|@rs|? > 0, where the left side is zero only if 
u = 0. By the standard completion of the inner product space (Hi o H2)/L, we 
finally arrive at a Hilbert space H which satisfies the definition requirements (t1) 
and (t2). &f 


The next question concerns relations between different tensor—product realiza- 
tions. They appear to be quite simple. 


2.4.2 Proposition: Let (H,@) and (H,®’) be realizations of the tensor product 
of Hi and Hz ; then there is a unique isomorphism V : H — H’ such that 
x ®' y=V(x@y) holds for all [x,y] € Hi x He. 

Proof: By Problem 20, x ®y=Z®@y implies x @'y =Z%®'y for any [r,y], [Z,%] € 
Hy x He ; hence the relation z ®’ y = V(x @ y) defines a bijection from {zr @y : 
[x,y] € Hy x He} to {x@’y : [x,y] € H, x H2} which preserves the inner product. 
The sought isomorphism is then obtained by its linear and continuous extension; the 
argument is the same as in Theorem 2.1.8. 


These results demonstrate that the tensor—product realizations for a given pair 
H, and Hp exist and are unique up to an isomorphism. This is the starting point 
for an abstract definition of the tensor product (see the notes). For our purposes, 
however, it is not needed: the tensor product of Hilbert spaces H1, He will simply 
be understood as any of its realizations; in concrete situations there is usually some 
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standard map ® which we shall specify when introducing the particular tensor 
product. For simplicity we shall often write H = 7H, ® H2 remembering that the 
formula is just a shorthand expression for the fact that the tensor product of 7 
and Hp, is realized in H through a map ®. 


2.4.3 Remark: One defines the algebraic tensor product of arbitrary subspaces 
Ly CH, and Ly C Hg by L1xLlog:={x@y: re Ll, y © Le hin. We can easily 
check the relation L,xL, = L, @ Ly (Problem 23); in particular G, @ Go = G:xGe 
holds for closed subspaces G; C H;. 


2.4.4 Proposition: (a) If M, is total in H,,r = 1,2, then the set Mj. := 
{z@y:xr2EM,,y € M2} is total in H,; @ He. 


(b) If {e) : a, € I,} is an orthonormal basis in H,, r = 1,2, then the vectors 
el) @ a>, a, € I,, form an orthonormal basis in H,; ® H2. In particular, 
H ® He is separable if the same is true for 7H; and He. 


Proof: Denote L, := (M,)tin ; in view of the above remark it is enough to prove that 
zr®y € L,xLe for all x € H, and y € He. Since the M, are total by assumption, 
it follows from the estimate that 


||z@y—Ln @Ynl| < (ern) @Yn|| + [lz @ (Yn—y)|| = [22a lyn lle + [ella lyn—ylle - 
The remaining statement is an easy consequence of part (a). § 


2.4.5 Example: Under the assumptions of Proposition 2.2.4 we define a map ®: 
L?(M, dy) x L?(N,dv) + L?(M x N,d(u@v)) by (f @9)(ay) = flag(y). It 
is obviously bilinear, (f ® 9, f ® 9) = (f,f)u(g,9). by the Fubini theorem, and 
{f@g: f € L?(M,du), g € L?(N,dv)} is total in view of Proposition 2.2.4. Hence 
the requirements (t1), (t2) are valid and we may write 


L?(M x N,d(u®v)) = L?(M,dp) @ L?(N, dv). (2.5) 


In a similar way, we can check that the Hilbert space L*(X,dy;G) of Example 2.2.5 
can be expressed in the tensor—product form 


L?(X,dp;G) = L?(X, du) @G ; (2.6) 
the corresponding bilinear map is (f ® ¢)(x) := f(x)@ (cf. Problem 11b). 


Notes to Chapter 2 


Section 2.1 The concept of an abstract (separable) Hilbert space belongs to J. von 
Neumann, who formulated it at the end of the twenties [ vN 1,2]; it was extended a few 
years later to nonseparable spaces by H. Léwig and F. Rellich. The name was chosen 
in honor of D. Hilbert, who investigated the ¢? and L? spaces at the beginning of the 
century. 
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The orthogonal—projection lemma admits a generalization: it is clear from the proof 
that G may be replaced by any closed convex set in H. The existence of the orthogonal 
decomposition H = G@G+ for any closed subspace G C H is a characteristic feature of 
Hilbert spaces, distinguishing them among Banach spaces — cf. [DS 1], Sec.VI.12. Proof 
of the second part of Theorem 7 is somewhat lengthy — see, e.g., [DS 1], Sec.IV.4. 

The dual #* is a Hilbert space with the inner product (-,:), defined by (f,g)« = 
(Yg, Yr) ; using this result, one can check that any Hilbert space is reflexive. It is also easy 
to see that the kernel of any functional f € H* has codimension one, since Ker f = {ys}+. 
To check that and H* are linearly isometric, we can combine the Riesz lemma with 
the isometric involution J defined for a given orthonormal basis {e, : a € I} by 
Jz = Veer (2) €a)ea: 


Section 2.2 For Fejér’s theorem see, e.g., [ Jar 2], Thm.191. Separability of the spaces 
L?(M,du) depends on the properties of the measure p. Denote the corresponding o-al- 
gebra as A; then a system N C A is called a basis of the measure ys if inf{ u(MAN): 
N € A} =0 holds for each M € A with p(M) < oo. Assume that p is o-finite; then 
a space L?(M, dy) is separable iff yz has a countable basis (Problem 8) and the same is 
true for the L? spaces, p>1. 

The functions from the closed linear envelope of the set {e, : e,(2) =e”, XE IR} C 
CUR) are called almost periodic; they form a subspace in the Banach space C(JR) with 
the norm ||- |j~., which we denote as AP. H. Bohr has proven the following criterion: 
a function f : R — C is almost periodic iff it is continuous and to any € > 0 one 
can find / > 0 such that any interval of the length / contains at least one t such that 
|f(x+t) — f(x)| <e holds for all 2 € RM. In addition to ||- |loo, the space AP can be 
equipped with another norm, 


ae 
(f)? = jim == fl Uile)Pae 


(Problem 9), which is obviously induced by an inner product. The completion of AP with 
respect to p(-) is thus a Hilbert space, which is not separable because {e,: A € R} is 
an orthonormal set in it, which is not countable. The basic properties of almost periodic 
functions, are discussed in [DS 1], Sec.IV.7; [RN], Secs.101, 102. 

In the proof of Theorem 7 we have used the fact that the limit of a sequence of 
holomorphic functions which converges uniformly on compact sets is holomorphic — cf. 
{Ru 1], Thm.10.28. If GC @ is an open connected set we can in the same way as above 
construct the Hilbert space A?(G) of functions, which are analytic and square integrable 
in G. However, the measure used to define the inner product is not necessarily jug ; if 
G is bounded, for instance, we can replace it by the Lebesgue measure. In particular, if 
G is the unit disc D, := {z €@ : |z| <1} then the functions e, : en(z) = \/ 4 2", 
n=0,1,..., form an orthonormal basis in A?(D,) — see [ Hal 2], Secs.24, 25. 


Section 2.8 The orthogonal sum )>®.; Ga is sometimes called also the Hilbertian sum 
of the subspaces. If € = {e;} is an orthonormal set in H, and G; are the corresponding 
one-dimensional subspaces, then Theorem 2 reduces to the Fourier expansion of x with 
respect to €. In this sense the relation of Problem 13c represents a generalization to the 
Parseval identity; in the same way one can generalize the Bessel inequality. 

As in the case of a finite family, the direct sum >°,¢; Ha can be written as the 
orthogonal sum of the subspaces H'®) := { X € Does Ha : X(B) = 0g for B # a}, which 
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are naturally isomorphic to Hg ; this is checked easily using Problem 14. 

Direct integrals of Hilbert spaces were introduced by J. von Neumann. They have 
numerous applications, e.g., in spectral theory of self-adjoint operators ({ BS], Chap.7; 
[ Mau], Chap.IX), diagonalization of commutative W*-algebras ([ Di 1], Sec-II.1), de- 
composition of C*-algebras or representations of locally compact topological groups to 
irreducible components ([ Nai 1], Chap.VIII; [BR 1], Sec.4.4), etc. Some authors like 
[ Nai 1], Sec.41, define the direct integral only for the particular class of maps x ++ H(z) 
described in Problem 18. Our definition of measurability is adopted from { BS], Sec.7.1; 
a more general approach can be found, ¢.g., in [Di 1], Sec.II.1; [BR 1], Sec.4.4.1 


Section 2.4 The abstract definition of the tensor product is formulated in terms of the 
theory of categories — see, e.g., [ KGv], Sec.I.3 and Problem 61. The construction used 
to prove Theorem 1 is adopted essentially from [We], Sec.3.4. There are alternative ways 
to construct the space H: in [RS 1], Sec.I1.4, one finds a construction based on bilinear 
forms on H, x H2; another method uses continuous antilinear operators Tzy : He - Hy 
defined for any x € Hi, y € He by Try := (-, y)2z — see Problem 3.39b. 

All the definitions and results formulated here for a pair of Hilbert spaces extend easily 
to any finite family; the map [x,...,2%n] + 21 ®---@ Zp is then multilinear, fulfils the 
condition (%1®---@rn, 2@:--@2;,) = [Tjai(2j, 25); , and the set { 11@---@¢p : 23 € H;} 
is total in H,; @-:-@H,. The notion of a tensor product also extends to infinite families 
of Hilbert spaces {[vN 3]; however, we shall not need it in the present book. 

We have introduced the algebraic tensor product for subspaces of a Hilbert space. If 
Vi, V2 are arbitrary vector spaces, a realization of their algebraic tensor product is a vector 
space V together with a bilinear map @ : V; x V2 — V fulfilling the following conditions: 


(i) {r@y: 2EVi,yeVohimn=V, 


(ii) for any other pair (V’,®’) which fulfil requirement (i) there is a linear operator 
T:V—V’' such that 2 @!y =T(x@y) holds for all [x,y] € Vj x Vo. 


From here one can again arrive at an abstract definition of the tensor product — see, e.g., 
[ MB], Sec.1X.8. It is possible, however, to identify the algebraic tensor product with one 
of its realizations, in which case we write symbolically V = VjxV2. To fulfil condition 
(ii), it is sufficient that the map [x,y] + 2 @y has properties (a), (b) of Problem 20; this 
ensures consistency of the present definition with that of Remark 3. The existence of an 
algebraic-tensor—product realization can be checked again constructively. We can start, 
e.g., as in the proof of Theorem 1; however, the procedure is now different since there is 
no inner product (Problem 22). 

The notion of a tensor product is also introduced for Banach and locally convex 
spaces. The definition starts from the algebraic tensor product of the appropriate vector 
spaces. Given Banach spaces 4, with the norms ||- ||,, r = 1,2, we call a call a norm 
|: |] on 441 x%Q a cross-norm if |jx @ y|| = |lzllillylle for all z € Ai,y € X%. In 
distinction to the Hilbert-space case, the last condition does not determine the norm on 
* x2 uniquely; there is, however, a maximum cross-norm defined by || 7.2% @ yella := 
inf { dj llzjllillyglle ¥ where the infimum is taken over the set of vectors bey x’, @ Y; € 
Ax fulfilling 52; @y; = Dyce @ye- If || Ila is a cross-norm on 2x, then the 
Banach space obtained by the standard completion is denoted 41 @q X2 ; for 41 @q.X%2 we 
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also use the symbol 4,@42. The cross—-norms on 4X2 and their relations to the cross— 
norms on 4/4} are discussed in [Sch], see also [KGv], Sec.III.1.4; for an extension of 
the construction of %,®2 to locally convex spaces see [Gr]. 


Problems 


1. Prove (a) let G be a finite-dimensional subspace in a Hilbert space H ; then if 
{e1,.-.,@n} is an orthonormal basis in G, the orthogonal projection y, of a 
vector  € H to G is given by yx = D1 (e;,z)e;, 


(b) the Bessel inequality. 


Hint: (a) The distance of x from y € G is a function of n complex parameters; 
find its minimum. 


2. Let M and G bea subset and a closed subspace, respectively, in a Hilbert space 
H. Prove 
(a) Mn M?+ c {0}, in particular, M+ = {0} if M is total, 
(b) G++ =G, and more generally, M++ = Mim, which implies H = M+ @ Min. 


8 Let L,, Lo be subspaces in a Hilbert space # and dim Lz < co. Prove 


(a) if L, is closed, the algebraic sum L, + L2 is a closed subspace, 


(b) if T is a linear operator on H such that L1+L2 C Dr and the subspace TL, 
is closed, then there is a subspace M Cc (TL;)+ such that dimM < dimL, 
and T(L,;+L2)=TL,0M. 


4. There is no Hilbert space with a countably infinite Hamel basis. 
Hint: Use the Gram-Schmidt theorem. 


5. Let # be a separable Hilbert space. 


(a) Prove the existence of a countable orthonormal basis in # without using the 
Zorn lemma. 


(b) If G is adense subspace in #1, there is an orthonormal basis € of H contained 
in G. 
Hint: (a) From a countable dense set M one can pick a linearly independent subset 
N such that M C Min. (b) If {an : n = 1,2,...} is dense in H, there is 
{Ynm : 2,m =1,2,...} CG such that |l¢n—gynmll < m7. 
6. The basis Ey = {hn : n = 0,1,...} in L?(UR) defined by (2.1) is contained in 
S(IR) and its elements are eigenvectors of the Fourier~Plancherel operator, Fhn = 
2 or, we O,.... 


7. The set F, ie a@ > —1, of Example 2.2.3 is an orthonormal basis in L*(JR*). 
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8. Let (X,A,p) be a measure space with a o-finite ys ; then the space L?(X, du) is 
separable iff 4 has a countable basis. 
Hint: If {fn : n = 1,2,...} is dense in L?(X,du), we can find f, such that 
xm —fnll < m7! to any m and M&A with u(M) < oo. Denote Mam := {z: 
[fa(2)—1| <2-™} and show that p(MAMpm) < m7. 


9. Let f be an almost periodic function. 


(a) Suppose that a sequence {fn} C {e,: 4 € Rhin fulfils ||fn—fllo + 0. Define 
F,(T) := 3p |= fn(x)|?dx and F(T) corresponding in the same way to f. 
Check that {F,} converges to F uniformly in JR* and use this result to 
prove the existence of the limit p(f)? = limr_.. F(T) and the relations 


ay 
(f) = lim, pUfn) > pUSn)® = im, = | Mf(e+0) Pde 


forany te R. 
(b) p(f) > 0 unless f is zero. 


Hint: (b) Use the Bohr criterion (see the notes to Sec.2.2). 


10. The map f +—> Fy(f) := f(w) is for any w € € a bounded linear functional on 
A2(€) whose norm is ||Fi|| = el’/2. 


11. Prove: (a) A vector-valued function F : X — G is measurable iff the functions 
(e;, F(-)) are measurable for all elements of some orthonormal basis {e;} of 
the space G. 


(b) If { fa: @€ I} is a total set in L?(X,dz) and {e;} is an orthonormal basis 
in G, then the set { fa(-)ey : aE 7, 7 =1,2,...} is total in L?(X,dy;G). 


12. Check the relation (2.3) for coherent states in L?(JR) and prove that 
[ shw(a)Pde = ViRew, f Fo@) F*(ade = -iFImw. 
R R dx 


Hint: Use part (b) of Theorem 2.7 and the generating function of Hermite polyno- 
ke 
mials, Sk2o Fr He(x) = e-* +222 for zEC and re R. 


13. Let {Ga: a€I} be a family of mutually orthogonal closed subspaces of a given 
H ; then 


(a) (38192) = Neer 92, 


(b) if vq is the projection of a vector r € H to Gy and {OF a...) 8S, 
finite subset in J, then the projection of x to the subspace Dj=1Ga; equals 
Wet La; ’ 

(c) given a countable subset {aj : j = 1,2,...} C I and a sequence {z;} with 
tj E Go; , Set Sn := 1 2;. The sequence {sp} converges iff 32, |lz;\l? < 
oo ; if this condition is valid and x := limpoo $n, then x; is the projection 
of x to Ga, and |lx||? = O72, |lzyll?. 
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Let {Ga: a€I} bea family of mutually orthogonal closed subspaces of a given 
#H, where J is an arbitrary index set. The subspace sou 1Ga CH consists just of 
those vectors for which there is an at most countable {a,: k = 1,2,...} CJ such 
that 2 = limo Dja1%j with 2; € Ga,. 


Let &; be an orthonormal basis in a Hilbert space H;, 7 =1,2,... ; then €& = 
21 {0,... 0,€;,0,...} is an orthonormal basis in @j=1 H;. In particular, 1 is 
separable provided all the spaces 1; are separable. 


Any measurability basis {E,} belongs to V,. A vector-valued function F' belongs 
to V, iff x (F(x),G(x))2 is measurable for all G € V,. 


Let (r++ (2x), V,) and (++ H'(z), Vj.) be measurable Hilbert-space fields such 
that dimH(x) = dimH'(x) holds for all x € X ; then there is a map x + U(z) 
such that 


(a) U(x) is an isomorphism of the spaces H(x) and H’(x) for any x EX, 
(b) a vector-valued function F’ belongs to Vj, iff there is F € V, such that 
F'(x) = U(x)F(z) for all rE X. 


Let (r++ H(x), V,) be a measurable Hilbert-space field with a measurability basis 
{E,: k=1,2,...}, and denote H = f? H(a) du(x). Prove 


(a) if fe(x) := (E(x), F(x))2 for a given F € H, then the sequence {f,} belongs 
to L?(X,du) and 


(F,G) = | D> feo du = >> (fies 9k) 12 | 
X p=1 k=1 


in\particular, F =0 if f, =0 for k=1,2,..., 

(b) if a sequence {fx} C L*(X,du) fulfils 7Z2, ||fellz2 < 00, then there is just 
one F € # such that (Ex(x), F(x))2 = fe(x) holds for y-a.a. rE X, 

(c) the space H is complete, i.e., a Hilbert space, 

(d) if {fa : a € I} is a total set in L*(X, dy), then the vector-valued functions 
re» For(x) := fok.(x) form a total set in 7H, 

(e) if L?(X,dy) is separable, the same is true for H. 


Let X = Uo Xn be the decomposition of a set X corresponding to a given 
dimension function d. Choose a sequence {G,}°2, of separable Hilbert spaces such 
that dimG, =n,n=1,2,..., and dimQp = oo and define Hg(x) := G, for x € 
Xn,n=0,1,.... Using orthonormal bases in the spaces G, construct a measura- 
bility basis in XzexHa(x). Prove that the map V: F +> {F,}%, with F, := 
F} Xp is an isomorphism of the spaces JY H(x) du(x) and @&2.9 L?(Xn, du; Gn) - 


Let x @y € Hi @ He ; then 


(a) x@y=O0 iff at least one of the vectors x,y is zero, 


(b) if « @y is nonzero and equal to z’ @y’, then there is a nonzero a € € such 
that 2’ = ag and y'=a7y. 
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Prove: (a) If dim; =n < oo then H, @ He is isomorphic to Qe) He. 


(b) Let Hy = pe 19a, where the Ga are mutually orthogonal closed subspaces, 
then the subspaces Ga@H2 C Hi @Hz2 are mutually orthogonal and Hi @H2 = 


Srey Go ® Ho . 


Let V,, V2 be arbitrary vector spaces. Show that a bilinear map [z,y] Hp r@y € 


V, © V2/L satisfies implications (a), (b) of Problem 20. 
Hint: Check under which conditions roy € L or roy—z' oy! € L, respectively. 


IX L2 = LT; ® Tz holds for any subspaces L; C H;. 


Let €(*) = {e} and ¥ = {f,} be orthonormal bases in the Hilbert spaces +, 


and H2, respectively; then (Ee Ofp: PH l,... ima, b= yi} & 
an orthonormal basis in #11; ® He. 


Chapter 3 


Bounded operators 


3.1 Basic notions 


The space B(H) of all bounded linear operators on a given Hilbert space H re- 
presents a particular case of B(V,V’) discussed in Section 1.5, so all of the results 
derived there apply. At the same time, the existence of an inner product in H 
brings new information. For instance, we know that B(H) is a Banach space with 
the norm of Problem 1.51; now we are able to express it alternatively as 


[Bl = sup |(x,By)| = sup |(z, By)|, 
z,yES} r,ye By 


where $), 8, are the unit sphere and the unit ball in #H, respectively. Another 
simple consequence concerns bounded sesquilinear forms on H, i.e., the forms f 
to which a positive c exists such that |f(z,y)| < cllz|l |ly|| for all c,y € H. Using 
elementary properties of the norm together with Riesz’s lemma, we get 


3.1.1 Proposition: There is a one-to-one correspondence between bounded ope- 
rators and bounded sesquilinear forms; it relates B € B(H) to fp: fa(z,y) = 


(x, By). 


For any B € B(H), we can define the form ff by fp(x,y) := fa(y,x). Since 
it is again sesquilinear and bounded, there is just one operator B* € B(H) such 
that f%(r,y) = (a, B*y), in other words, 


(y, Bx) = (B*y,z) (3.1) 


for all x,y € H; we call it the adjoint to B. 


3.1.2 Theorem: (a) The map B+ B* is an antilinear isometry of B(H) onto 
itself, which fulfils (BC)* = C*B* and B** = B for any B,C € B(H). 


(b) If B is invertible the same is true for B* and (B*)~1 = (B-!)*. 


Proof: Antilinearity and the last two relations in (a) follow from (3.1) and Problem 3. 
Inserting z = B*y into (3.1), we get ||B*y|l? < |lyll Bll lB*yll, so |B*ll < Bl ; 
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replacing B by B* we prove B +> B* is an isometry. To check (b), notice that 
B-' is bounded and use B~'B = BB-! = I together with part (a). 


3.1.3 Example: To a given orthonormal basis € := {e;}#, in a separable H 
define Sp on Etim as a linear extension of Soe; := €;41, 7 = 1,2,.... It is bounded, 
\|Sox|| = |lzl| for any x € Ej, so its continuous extension S called the right-shift 
operator has ||S|| = 1. It follows from (3.1) that 


B23 — = eo. oe, 0 eee j=1 
mo 2aler$ es)eh = €;-1 ... otherwise 


and due to continuity, S* is fully determined by these relations; we call it the 
left-shift operator. 


3.1.4 Proposition: Ker B* = (Ran B)* for any Be B(H). 

Proof: Since B* is continuous, Ker B* is a closed subspace. If x € Ker B* and 
z = By, then (z,z) = (B*z,y) = 0 so Ker B* C (Ran B)*. On the other hand, 
z € (Ran B)* implies 0 = (z, By) = (B*z,y) forall ye H,ie., ze KerB*. & 


As an illustration of how the adjoint operator may be used, let us prove the following 
representation of bounded operators in L?(JR). 


3.1.5 Proposition: Let B € B(L?(IR)) ; then there is a function G: R? — € 
such that 


(a) the function G, := G(z,-) belongs to L?(JR) and Go = 0, 


(b) x++ (G;, f) is absolutely continuous in any finite interval [a,b] C R for all 
feL’(R), 


@) Ca) = a In G(x, y)f(y) dy for all f € L?UR) and almost all z € R, 


(d) G is essentially unique: suppose a function G:= IR? = € fulfils (a) and there 
is a total set D C L?(JR) such that (b),(c) are valid with G for all f € D, 
then G(z,y) = G(x, y) holds for all x € JR and almost all ye R. 


Proof: Let Jz be the interval with the endpoints 0,2. Setting e, := sgn (x)xy, , 
we have €) = 0, e, € L?(IR), so Gy := Be, satisfies (a) and 


(Gof) = (ee, Bf) = sen(z) ff (Bf)(y) dy 


for any f € L?(UR). The function z++ F(x) := f7(Bf)(y) dy is absolutely con- 
tinuous on any finite interval (a,b) and fulfils there (G,, f) = F(x) + const ; hence 
(b) is valid and 


(BAe) = Fa) = FG. N= Sf cewserdy 
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for almost all zx € (a,b), — therefore a.e. in JR. Finally, consider a function G 
specified in part (d). Then £(G:, j= £(Gz, f) for almost all x € JR and any 
f € D; in view of the aovalate continuity and the identity Go = Go = 0, we have 
(C.f)= (G,,f) for all x € R, f € D, so G, and G, represent the same vector 
in [?UR). § 


3.1.6 Example (a functional realization of the FP-operator on L?(JR) ): In the 
same way as in Example 1.5.6 we can check that 


(F*h) (@) = (2n)-? / eh(y) dy (3.2) 


for any h € L?™L'. Moreover, we have (f, Fg) = (FF-'f, Fg) = (F-'f,g) for 
any f,g € L?, so F* = F~!. Notice that this relation and the formula (3.2) hold 
in L?(R") as well; we have only to replace (27)~1/? by (21)-"/? and e*¥ by e”¥ 
where x-y means the inner product in JR”. 
Since e; € L? L', the functions G and G* referring to F* = F-! are easily 
found: the operators F,F~! act at any f € L?(JR) as 
e7 ity 


(Ne) = On ae fy ay HO 


— flu) ay. 


The functions G,G* here are complex conjugated, caret is not true in general for 
the representation of a pair B, B* — cf. Problem 6. 


(FA) (@) = @nyr Pt f = 


IR 


Next we show that the notion of a matrix representation known from linear 
algebra can be extended to bounded operators. We shall assume that is separable 
and shall choose an orthonormal basis € = {e;}72,; C H. To any B € B(H), we 
define an infinite matrix by 


By = (e;, Bex) , 9,k=1,2,... 5 (3.3) 


its k-th column is formed by the Fourier coefficients of the vector Be, , so Parseval’s 
identity gives )772, |Byx|? < co, k=1,2,.... The same argument applied to B*e, 
yields 752 [Bes <o,k= ip ish Furthermore, we can express the relation 
between the Fourier coefficients of x ant Bx by 


(e;, Bx) = 2 Bjx(€k, 2) , 


where the absolute convergence of the series is checked easily using the Hélder in- 
equality. In this way, we have defined a mapping of B(H) onto the set of infinite 
matrices with square summable rows and columns, which is obviously linear and 
satisfies the standard matrix rules 


(BC). = > BnCn, (B), = Bij ; 
i 
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we speak about the matrix representation of the operator B with respect to the 
basis €. 

On the other hand, a matrix (B;,) is said to represent an operator B if there 
is a separable Hilbert space with an orthonormal basis € such that the relations 
(3.3) hold. A finite matrix always represents a bounded operator on an appropriate 
finite-dimensional space, while in the general case the problem is slightly more 
complicated. 


3.1.7 Theorem: An infinite matrix (bk) 5 et represents a bounded operator iff 
the mapping 

co i oo 

(KY) FY ¥ bene) 

j=1 k=1 
where X = {&}, Y = {m} are any vectors of @*, is a bounded sesquilinear form 
on 
Proof: The necessary condition is simple. On the other hand, choosing Y; = {61 }52, 
we find from the boundedness of F that S°72, by€e, converges for any {&} € 
@, and therefore S772, |ba|? < oo. Hence we can pick a separable H with an 
orthonormal basis € = {e;} and define e, + fi := DR, beer. Denote by Bo 
the linear extension of this map. We can easily check that it is bounded, and 
its continuous extension to H is the sought operator B, which is by definition 
represented by the matrix (b;,). & 


The space B(H) can be equipped with topologies other than the topology 7, 
generated by the operator norm. Let us now mention two of them. Consider first 
the system of seminorms P, := { pz: x € H, pr(B) := ||Ba|| }, which is obviously 
separating points. The topology generated by P, is called the strong operator 
topology (we use the symbol 7, ) and the corresponding locally convex space will 
be denoted as B,(H) ; its local basis is formed by the sets 


V.(M) := {B € B(H): p(B) <¢,2EM} 


for all ¢ > 0 and any finite set M C H. A sequence {B,} converges in B,(H) 
to an operator B iff B,x — Br for all zx € H; we use the notation B, > B 
or B = s-limn.B,. The following results are easy consequences of the uniform 
boundedness principle. 


3.1.8 Proposition: (a) strong completeness of B(H): let {Bn} be a sequence 
such that {B,r} is Cauchy for any x € H ; then there is B € B(H) such 
that Br > B, 


(b) sequential continuity of multiplication: if By + B and C, > C ; then the 
product sequence B,C, —> BC. 


In distinction to the operator-norm topology, however, assertion (b) does not imply 
continuity of the multiplication (understood as a map B,(H) x B,(H) > B,(H)). 
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The reason is that the topological space B,(H) is not first countable with the 
exception of the trivial case of a finite-dimensional H. 


3.1.9 Theorem: If dimH = oo, the operator multiplication is not continuous 
on. By (7). 

Proof: We may assume that 1 is separable and € = {e,;}%, is an orthonormal 
basis; it is sufficient to find for any neighborhoods V, := V.,(M,),r = 1,2, a 
pair of operators B, € V, such that B,By ¢ Vij2({x}), where x € H is a unit 
vector. Consider the shift operators of Example 3: to any 6 >0,n=1,2,..., we 
define Bus = $(S*)” , Cag := 6S". The operator BrsgCng = I does not belong 
to Vijo({x}) for any n,6. Choosing 6 = 62 := €2(2 maxyem, |lyl| )~!, we have 
Cn. € Vo for any natural number n. Moreover, (S*)” —> 0, so we can find a 
natural number n; (depending on €1,62,.Mj,) such that || (S*)"* z|| < €16 for all 
z € M,, and therefore Bn, 5, € Vi. 


It is easy to check that the two topologies on B(H) are related by 7, D 7,. The 
proved theorem tells us, in particular, that the inclusion is nontrivial. This can be 
illustrated in an even easier way (Problem 8). 

Let us now pass to the weak operator topology 7, on B(H). It is generated 
by the system of seminorms Py := { Dry: 2,y € H, pry(B) := |(y, Bx)|}, which 
again separates points. The corresponding local basis is formed by 


W.(M, N) := {B € B(H) : py(B) <e,r€M,yeEN} 


and the locally convex space (B(H), 7) will be denoted as B,(H). The weak 
operator convergence of a sequence {B,} to an operator B is therefore equivalent 
to (y,Bnx) — (y, Bx) for all x,y € H ; we use the notation B, + B or B= 
w-limn_..B,. The space B(H) is also weakly complete. 


3.1.10 Proposition: Let {B,} C B(H) be a sequence such that {(y, Brx)} is 
Cauchy for any 2,y € H; then there isa B € B(H) such that B, > B. 

Proof: The weak completeness of # implies that B,x converges for any x € 
H to some z(r) = Bx. Since the B, are linear operators, B is linear also. 
The sequence {B,x} is weakly convergent and therefore bounded; the uniform 
boundedness principle then implies ||B,|| < c for some c and n = 1,2,.... We 
have |(B,x, Bz)| < cl|z|| ||Ba|| and |(B,x,Bx)| > ||Ba||?,so BE B(H). I 


On the other hand, the multiplication in B,,(H) is not even sequentially continuous 
(Problem 9). In general, the nontrivial inclusion 7, C 7, is valid (Problems 9,10); 
more about topologies on B(H) will be said in Section 6.3. 


3.2 Hermitean operators 


An operator A € B(H) is said to be Hermitean if A = A*; by properties of the 
inner product this is equivalent to the condition (x, Ar) € JR for all  € H. The 
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set of all Hermitean operators forms a real subspace in B(H) ; however, it is not 
closed with respect to multiplication. The analogy between Hermitean operators 
and real numbers inspires the decomposition B = ReB + iImB with 


] 
nape 5(B +B), ImB := —(B-B") 


valid for any B € B(H). 

The numerical range of a linear operator T : H — H with the domain Dr 
is defined as Q(T) := {(z,Tx) : x € Dr, ||z|}| = 1}. A bounded operator A is 
therefore Hermitean iff @(A) C R; the numbers 


ma := inf@(A), Ma, = sup Q(A) 


are its lower and upper bound, respectively. They provide us with an alternative 
definition for the operator norm. 


3.2.1 Proposition: The relations ||Al]| = max(|mal|,|Ma|) = supyqyoy |(z, Ar)| 
hold for any Hermitean operator A. 

Proof: Call the right side c4. The last identity as well as the inequality c4 < || All 
verify trivially. For a Hermitean A, the polarization formula gives Re(y, Ar) = 
4(qa(z+y) — qa(a—y)), where ga(z) := (z, Az), and therefore 


IRe(y, Az) < 5a (lle+ull? + lke—wll®) = 5ca (lel? + lal?) 


for all x,y € H. In particular, if x is a unit vector with Ax 4 0, then one has to 
choose y = Az/||Az|| to get ||Az||<c4. JW 


3.2.2 Example: Consider L?(a,b) with b—a < oo and define the operator Q by 
(Qf)(x) := xf(x) for any f € L*(a,b). It is Hermitean with mg = a, Mo = b, 
and ||Q|| = max(|al, |d}). 


An operator A € B(H) whose numerical range is such that m4 >0 (my, > 0) 
is called positive (strictly positive); we use the symbols A > 0 and A > 0, 
respectively. Positive operators satisfy the Schwarz inequality, 


| (y, Ax) |? < (x, Ax)(y, Ay) . (3.4) 


3.2.3 Example: For an arbitrary B € B(H) we have (x, B* Br) = ||Bz||? >0,s0 
B*B>0. Then ||B*B\| = Mg-g and Proposition 1 implies 


|B Bi] = BI) ; (3.5) 


this equality plays an important role in the theory of operator algebras — cf. Sec.6.1. 
In particular, fora Hermitean A we get by induction {| A?” || = ||A||?”, so its spectral 
radius equals 

r(A) = Jim A?" = [Al 
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The notion of positivity allows us to introduce inequalities between Hermitean 
operators: we write A > B if the operator A—B is positive. Operator inequalities 
can be manipulated using simple rules analogous to those of numerical inequalities; 
we leave this to the reader. It is also clear that the norm is nondecreasing on the 
set of positive operators: A > B > 0 implies ||Al| > ||Bll. 

Operator inequalities can be also used to define the supremum and infimum 
for an arbitrary set { A, : a € I} of Hermitean operators: A‘) := sup,e; Aa is a 
Hermitean operator such that (i) A, < A‘) forany a € J, (ii) if B isa Hermitean 
operator such that A, < B for all a € J, then A) < B. Ina similar way, we 
define the infimum. The supremum and infimum may not exist, in general, because 
< defines only a partial ordering on the set of Hermitean operators. Existence is 
guaranteed, however, for fully ordered subsets. 


3.2.4 Theorem: Let {A,} be a nondecreasing sequence of Hermitean operators, 
and B be a Hermitean operator such that A, < B,n=1,2,.... Then there is a 
Hermitean A :=s-limp..An, and sup, Ap, = A. 

Proof: Denote fn({xz) := (z,Anx) for any « € H. The operator Amn := Am — An 
is positive for m > 7n, and inserting A= Amn, y = Amn into (3.4) we get 


Aran ll* < |fra(e) — fa(t)| (Arnis Aran) 


Norm monotonicity yields (Amn2,A7,0) < ||Amnl{3|lx|l? < ||B — Aill*llzl]? and 
since {f,(x)} is a nondecreasing sequence bounded by (z, Bz) , the sequence {A,r} 
is Cauchy, and by strong completeness, {Ap} converges strongly to some A € B(H). 
Then fr(z) — (x, Ax) so (x, Ax) = sup,, fn(x) for all 2 € H; this means that A 
is Hermitean and equal to sup, An. IU 


3.2.5 Remark: In the same way, s-limp4oAn = infp A, holds for any nonin- 
creasing below bounded sequence {A,}. Let us stress that the limit of {A,} with 
respect to operator—norm topology may not exist — cf. Problem 8. 


As an illustration to the above theorem, let us show how it can be used to 
construct for a positive A a positive operator B satisfying B? = A; it is natural 
to call the latter the square root of A and denote it as VA. 


3.2.6 Proposition: To any positive operator A there is a unique square root. 
Morever, the conditions AB = BA and VAB = BVA are equivalent for any 
Be B(H). 

Proof: Since the case A = 0 is trivial we may consider positive operators A with 
||Al| = 1 only. It follows from (3.5) that ||\VAl| =1,s0 0< A<J and O0< VA< 
I. Defining X :-=1—A,Y:=1-—WA we rewrite the condition VA? = A as the 
equation Y = 4(X + Y*), which can be solved by iteration, 


1 =e 2 = 
Y= 5X, You = 5 (X+¥n) for n=1,2,.... (3.6) 


Each Y,, is then a polynomial in X with positive coefficients so, in particular, 
{Y, : n = 1,2,...} is a commutative set. We have Yaii — Yn = 4 (Yn — Yn-1) 
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(Yn + Yn-1) and another induction argument shows that the differences Yn+1 — Yn 
are again polynomials in X with positive coefficients; hence {Y,,} is nondecreasing. 
It also follows from (3.6) that Y, < J for all n. Theorem 4 then implies the 
existence of a positive s-limpoo Yn =: Y < J, and performing the limit in (3.6) we 
get the sought solution, /A=I-Y. 

If B commutes with A, it commutes with X too, so BY, =Y,B. Taking the 
limit we get BVA = VAB. The opposite implication is trivial, so it remains to 
check the uniqueness. Suppose there are positive B,C such that B? = C? = A; 
we already know that this means BY, = Y,B. Set D:= B—C and y= Dz for 
any x € H. Then (y, By) + (y,Cy) = (y,(B? — C?)x) = 0, and since B,C are 
positive, we get (y, By) = (y, Cy) = 0. It follows further that By = VCy =0, 
so Dy=0 and ||Dz||? = (Dy,z)=0,ie, D=0. § 


The notion of a square root allows us to associate with any B € B(H) a positive 


operator defined by 
IB) = vB 


which has some properties similar to the complex—number modulus. However, the 
analogy should be used with a caution (Problems 14, 15). 

One of most important classes of positive operators consists of projections. 
Let G be a closed subspace in H; for any x € H we denote by y, its orthogonal 
projection to G. It follows from Theorem 2.1.2 that the map z+ y, is a linear 
operator; we denote it as Eg. By definition, Eg is bounded, Ran Eg = (Ker Eg)+ = 
G and ||Eg|| = 1 provided G 4 {0}. Moreover I — Eg is the projection to the 
subspace G+, so Eg+Egi =I. 


3.2.7 Examples: (a) Let € = {e1,...,e,} be an orthonormal set in H. The 
projection E™ to G:= Ejm acts as Es = Lia (ey, Bey. 


(b) The subspace G := { f: f(x) =0 fora.a. x >t} C L?(R) is closed and the 
corresponding projection F; acts as 


w= {NOD 25 


3.2.8 Theorem: An operator F defined on the whole # is a projection iff it 
is Hermitean and E? = E. Any projection E is positive, Ran FE is closed and 
consists exactly of the vectors which satisfy Ex =z. 


Proof: If E is a projection, the identity E? = E follows from the definition, 
and ((IJ — E)z, Ex) = 0 gives (a, Ex) = ||Ez|? € R, i.e., Hermiticity. On the 
other hand, let E be Hermitean and fulfil E? = E. For any y € Rank, y = 
liMpoo in, we have By = limpioo E*2_ = limps Etn = y due to continuity, so 
Ran E is closed and its elements fulfil Ey = y. The relation x = Ex+(I-E)zx 
defines its orthogonal decomposition for any x € H; hence E is a projection. §§ 
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Projections E, F are said to be mutually orthogonal if Ran L Ran F’; this is 
equivalent to the condition EF = FE =0. The following simple relations between 
projections are often useful. 


3.2.9 Proposition: Let E,F be projections on H. 


(a) E+F isa projection iff FE, F are mutually orthogonal; in that case we have 
Ran(£ + F) = RanE@RanF, 


(b) the following conditions are equivalent: 


(i) E -—F is a projection, 
(ii) E> F, 

(iii) Ran EZ > RanF, 

(iv) beer r= F: 


if these are satisfied, Ran (E£—F) is the orthogonal complement to Ran F’ in 
Ran £ , 


(c) EF is a projection if EF =FE;then RanEF =RanE 1 RanF. 


3.2.10 Remark: Let us denote € := RanE, F := RanF’. If the projections 
commute, their product projects to the subspace ENF. In the general case, ENF 
is still a closed subspace, and we can ask whether the corresponding projection G 
can be expressed with the help of & and F’. The sought formula reads 


G = s-lim(EFE)" = s-lim(EF)" (3.7) 


or the same with E and F interchanged. To prove this, we first show that the 
operators G, := (EFE)” fulfil 0 < Gayi < Gn < I so G:=s-limn.~. Gr exists 
by Theorem 4 and 0 < G < I ; we have G2 = Ga, so G? = G and G is 
a projection. Performing the limit in EG, = GnrnE = Gy, we get RanG Cc €E ; 
similarly GnF Gn = Gon+ gives the relation GF'G = G which implies || FG—G||? = 
(FG — G)*(FG — G)|| = ||G - GFG|| = 0, and therefore RanG Cc F. On the 
other hand, G,x =z for any x € EMF, which yields Gr = x ; together we get 
RanG =€NF. The second identity in (3.7) follows from FG=GF=G. 


3.2.11 Proposition: Any projections E,F ona given H satisfy ||E— FI <1. 
If || — F|| <1, then dim Ran E = dimRanF and dim Ker EF = dimKerF. 
Proof: Since the vectors E(J — F)x and (E—I)Fzx are orthogonal for any x € H, 
we have ||(E—F')z|? = ||E(I—F)2||?+||(£-DFa\)? < ||U-F)z|?+||Fx|? = |a|l?. 
The remaining assertion follows from Problem 18. §f 


Proposition 9 generalizes easily to any finite system of projections; in combination 
with Theorem 4 it gives the following result for sequences. 
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3.2.12 Theorem: To any monotonic sequence {E£,,}°2, of projections there is 
a projection E := s-lim,;...En. Its range satisfies Ran FE = (ZL, Ran E, if the 
sequence is nonincreasing, and Ran F = Ur2, Ran £,, if it is nondecreasing. 


Let {E™}22, be a nondecreasing sequence of projections; we set Ey := EB 
and EBay = — 8 , n =1)2,...; them {B; »g=— 2,...} a serroiam 
tually orthogonal projections due to Proposition 9. Since Ran EB! = j=, Ran EB; 
and the algebraic sum of subspaces )(72. Ran E; equals Uf2, Ran E\) | we see that 


1 s-lim > EB; =: Se E; (3.8) 
j=l j=l 


projects to the orthogonal sum @f2, Ran E;. This result generalizes to any system 
{ E,: a € I} of mutually orthogonal projections. We denote by E the projection 
to the subspace °°, Ran E, (Problem 2.14) and define ae; Ea := E. In parti- 
cular, {F,: a € I} is said to be a complete system of projections if E =I. The 
relation (3.8) and Theorem 12 have the following generalization. 


3.2.13 Proposition: Let {Fa : a € I} be a system of mutually orthogonal 
projections; then 

>> E, = slim Ex = sup Ex, 

aél 2 $ 

i.e, Ex = lims Exx for any zx € H, where S := {K € I! : K finite} and 
Ex := DaeK Bx. 

Proof: To any x € H and ¢ > 0 there is an at most countable {a,} such that 
Ex = 0 for a ¢ {ax} and ||Ex — SR, Ea, 2|| < ¢€ forall n > n(e,r). The family 
S is obviously directed, so there isa aset K €S such that K D {a1,...,Qner)}. 
This means that K M {ax} = {a1,...,Qn,}, where nx > n(e,x), and therefore 
|x — Exa|| < ¢. Furthermore, Ex < E and (2, Ex) = lims(z, Exx), so the 
second identity holds too. 


3.3. Unitary and isometric operators 


An isomorphism U : H — G between two Hilbert spaces is often called a unitary 
operator; this means that it maps surjectively H onto G and ||Uz||¢ = ||zl\x, for 
all x € H. Using the appropriate definitions, we easily check 


3.3.1 Proposition: An operator U € B(H,G) is unitary iff it is surjective and 
(Ux, Uy)g = (x,y) for all x,y e H. If G =H, this is further equivalent to the 
following: U is linear, everywhere defined, and U~} = U*. 


3.3.2 Example (substitution operators): Consider a continuous bijective mapping 
yp: IR” — IR” which satisfies the following conditions: 


(i) each component y; : IR” — MR of y has continuous partial derivatives 
OND}, =, .. ay, 
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(ii) there are positive numbers a,b such that the Jacobian D, := det(O,;) fulfils 
b-! < |D,(x)| <a for all re R". 


The set of all such bijections will be denoted as ®. The substitution theorem implies 
aes 2 
[fn @Pdr =f \f(o@))PIDp(a)| ax 


for any f € L*(JR"), and we readily check that y~! belongs again to ®. Define 
now the operator U, by 


(Uof) (2) = |Dy(2)|"? (foe) (a) ; 


it is a product of the operator of multiplication by |D,(-)|!/?, which is bounded 
and invertible by assumption, and of the “substitution” operator S,: S,f = foy. 
The latter is also bounded since ||f||? > }||S,f||?, and moreover, S,S,-1 = I, 
so Ran S, = L?(JR") and the same is true for RanU,. Finally, the substitution 
relation shows that U, preserves the norm, ||U,f||? = ||f||?, and therefore it is 
unitary. As a particular example, let us mention the reflection (or parity) operator 
R: (Rf)(x) = f(—zx) for all x € IR”. The definition relation implies R? = I, i.e., 
R-! = R, which means the operator R is simultaneously unitary and Hermitean 
(see Problem 20). 


To check that a given operator U is unitary it is often enough to verify the 
definition properties on a subset of # only. 


3.3.3 Proposition: (a) Let Up be a densely defined linear operator on H with 
Ran Up = H such that ||Uox|| = ||x|| for all 2 € D(Uo) ; then it has just one 
unitary extension. 


(b) Let M CH be a total linearly independent subset. If V : M — H is such 
that VM is again total and (Vz,Vy) = (x,y) for all x,y € M, then there 
is just one unitary U fulfilling U)/M=V. 


Proof: The existence of a unique extension U € B(H), ||U|| = 1, follows from 
Theorem 1.5.5. By assumption, for any y € H we can find a sequence {x,} C H 
such that Up, — y. Since Up is norm preserving, {z,} is Cauchy, so it converges 
tosome x and y = limgsoVotn = Ux, i.e, RanU =H. Furthermore, ||Uz|| = 
limpoo {|Uotn|| = limp ||Znl| = ||z||, which proves (a). Let Up be the linear 
extension of V. The subspaces Min, and (VM)yn, are dense by assumption, and 
inner-product linearity yields ||Uoz||? = ||z||?, so part (a) applies. 


A linear operator V on # is called isometric if D(V), RanV are closed 
subspaces and V is unitary as a map D(V) — RanV ; it is clear that V is a 
unitary operator on H iff D(V) = RanV =H. If D(V) # H we can extend V to 
an operator W € B(H) called a partial isometry such that Vw := W } (Ker W)+ 
is isometric. The extension is clearly unique and obtained by setting Wx := 0 for 
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x € D(V)+. The ranges of W and Vw coincide, so Ran W is closed; we call it the 
final subspace of W. On the other hand, (KerW)* is the initial subspace of W . 

Any unitary operator is a partial isometry. Other examples are projections, shift 
operators S,5*, or the operators P,, of Problem 6 ; the initial and final subspaces 
in all these cases are easily found. An inverse operator to W does not exist unless 
Ker W = {0} ; however, we can replace it by W* in the following sense. 


3.3.4 Proposition: The adjoint to a partial isometry W is also a partial isometry, 
with the inital and final subspaces interchanged, and the relation Vw- = Viv’ holds 
between the corresponding isometric operators. The operators W*W and WW* 
are projections to the initial and final subspace of W , respectively. 


Proof: Denote by E,,E; the projections to (KerW)+, RanW, respectively. We 
have (Wz, y) = (Viv Bix, Byy) = (Bic, Vig Byy) = (a, Vip'Byy) for any 2,y € H, 
so W* = V,,'E;. Furthermore, (KerW*)+ = RanW = EsH, so W*} (Ker W*)+ 
= V,,'. Hence the first assertion is valid, and it is sufficient to check the second one 
for W*W. We have W*Wa = W*VwEyz = Vy 'Vw Bia = Exx for any z € H, so 
W*W=E,;. & 


In conclusion we shall prove one more theorem inspired by the analogy between 
bounded operators and complex numbers (see also Problem 24). 


3.3.5 Theorem (polar decomposition for bounded operators): To any B € B(H) 
there is just one partial isometry Wg such that B = Wp|B| and Ker Wg = Ker B. 
Furthermore, the identity Ran Wg = Ran B holds. 


Proof: Define Wy : Ran|B| — RanB by Wo|Bl|z := Br. Since |B| need not 
be invertible, we have to check whether Wo is well defined (which would imply 
its linearity). This follows from the relation ||Bz|| = |||B\x|]| (Problem 14), which 
further implies that Wo is norm preserving. Its continuous extension is an iso- 
metric operator that maps Ran|B| = (Ker|B|)+ = (Ker B)+ to Ran B, and the 
corresponding partial isometry Wg satisfies Wg|B\lz = Ba for all x € H. To 
prove uniqueness, suppose there is a partial isometry W with the needed properties 
so that (Wg — W)|B| = 0. Using the orthogonal decomposition x = y+z with 
y € KerB and z € (KerB)+ = Ran|B| we find (Wg — W)z = (Wg —-W)z = 
limpoo(Ws — W)|Blun, where |Blun + z. By assumption, (Wg — W)|Blu, = 0, 
so (Wg—W)xr=0 foranyzeEH. §f 


3.4 Spectra of bounded normal operators 


We know from linear algebra that a linear operator B on a finite-dimensional H 
has an orthonormal basis of eigenvectors iff B commutes with B*. Now we are 
going to generalize this result to bounded linear operators on an infinite-dimensional 
Hilbert space. An operator B € B(H) is dubbed normal if BB* = B*B. The set 
of all bounded normal operators on H will be denoted M(H); it obviously contains 
the real subspace of Hermitean operators and the group U(H) of unitary operators 
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but the whole set M(H) is neither a subspace nor a group. We start from simple 
modifications of the definition. 


3.4.1 Proposition: The following conditions are equivalent: 
(a) BE N(H), 
(b) BYE N(H), 
(c) the operators Re B, Im B commute, 
(d) ||(B —A)z|| = ||(B* — A)z|| for all ce H, AEC. 


The last condition implies that A € o,(B) iff  € op(B*), and moreover, the 
corresponding eigenspaces Ker(B—.) =: Np(A) satisfy Ng-(A) = Np(A). In 
particular, o,(B*) = iff o,(B)=90. 


3.4.2 Theorem: Let Be N(H) ; then 
(a) if 4, are different eigenvalues of B, one has Ng(A) L Na(y), 


(b) if a subspace L C H is B-invariant, then L+ is B*-invariant. In particular, 
Na(A)+ is B-invariant for any \ € o,(B), 


(c) the residual spectrum of B is empty. 


Proof: Consider nonzero vectors x € Na(\), y € Na(u); we have Bty = fy, which 
together with Bx = Ax and (y, Bx) = (Bty,x) gives (A—y)(y, 2) = 0 proving (a). 
Consider further x € L+; by assumption By € L for any y € L, so (y, Btr) = 
(By,z) = 0, ie, Btx € L+. In particular, Ng-(A) = Ng(A) is B*-invariant, so 
Na(A)+ is B**-invariant. Finally, if \ € o,(B), i.e, Ran(B—A)+ ¥ {0}, then A 
is an eigenvalue of B* by Proposition 3.1.4; hence » € op(B) in contradiction with 
the assumption. §& 


3.4.3 Theorem: The resolvent set p(B) of any B € N(H) coincides with its 
regularity domain; in other words, A € p(B) iff there is a positive c = c(A) such 
that ||(B—A)z|| > cllz|| for all eH. 

Proof: If X € p(B), the above condition is satisfied for c = ||Rg(A)||7! since 
iiy|| > ||Re(A)||-? || Rea(A)yl| and any y € H can be written as y= (B—A)x. On 
the other hand, the condition implies Ker(B — ) = {0}, so (B — A)7! exists. 
Proposition 1 gives Ker (B* — 4) = {0}, so Ran(B— A) =; it remains to prove 
that Ran(B — A) is closed. Let a sequence {yn} C Ran(B-— 4) converge to y ; 
then there is {z,} C #H fulfilling y, = (B —A)zp, and the condition shows that 
{rn} is Cauchy. Setting x := limp %n, we have (B—A)x = y by continuity. 


3.4.4 Corollary: (a) Suppose that B € N(H), then » € o(B) iff inf{ ||(B—A)z|| : 
xz € S,;}=0, where S; is the unit sphere in H, 7.e., iff there is a sequence 
{an} of unit vectors such that (B—A)z, — 0. 
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(b) The spectra of unitary and Hermitean operators are subsets of the unit circle 
and the real axis, respectively. 


Proof: The first part is obvious. A unitary operator U fulfils 
\(U — Aol? = |Uall? + [AP ilzl|? — 2Re (Ux, Ax) > (1— Al)? lal’, 


so A € p(U) if |A| # 1 in view of the above theorem. In the same way, we 
obtain the relation ||(A—A)zl|? = ||(A—A1)2z||? +|A2|?\|z||? for a Hermitean A and 
= 1 +%2, which shows that o(A) C R. Gf 


The spectrum of a Hermitean operator can be further localized as follows (see 
also the notes to Section 3.2). 


3.4.5 Proposition: The spectrum of a Hermitean A is a subset of the interval 
{ma, Ma] C IR whose endpoints, i.e., the lower and upper bounds to A, belong to 
o(A). 

Proof: In view of the corollary, we have to check (—oo,m,) U (M4, 00) C p(A). If, 
for instance, \ € (My, oo) then the definition of M4 gives (z,(A—A)z) > A-— Ma 
for any unit vector x, so |\(A — A)z|| > |((A — A)z,z)| > A — Ma. Let us check 
further that m, € o(A). To this end, we apply the inequality (3.4) with |\z|| = 1 
and y = (A—my,)= to the positive operator A—m,. This yields ||(A—ma,)z||* < 
(x, (A—m,)zx)||A—ma|| ||(A—ma)z||?, so by definition of the lower bound we have 


inf (A— mada? < JA — mall (inf (Az) ~ ma) = 0. 


In the same way, we can check that M4, €o(A). ff 


3.4.6 Examples: (a) If & is a nontrivial projection, then mg =0 and Mg=1, 
and furthermore, we easily find ||(E—A)z||? > min{(1 — A)?, \?} |x|? for any 
A € (0,1), so o(F) = {0,1}. 


(b) Let @ be the multiplication operator of Example 3.2.2 and 2 € (a,b). We set 
fn c= Vnx1, , where I, = (a,b) Q, A+ 1) ; if n is large enough we have 
fri] = 1 and {\(Q — A)frll? = (3n?)-! 4 0. Since Q has no eigenvalues and 
or(Q) =@ by Theorem 2, we obtain o(Q) = o,(Q) = [a,}]. 


We have mentioned that an operator on a finite-dimensional H is normal iff 
it has an orthonormal basis of eigenvectors. If dim#H = oo this condition is no 
longer necessary; however, it is still sufficient. An operator B € B(H) will be said 
to have a pure point spectrum if its eigenvalues form an orthonormal basis; this 
definition is justified by the following result. 


3.4.7 Theorem: Any operator B € B(H) with a pure point spectrum is normal 
and o(B) =o,(B). 
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Proof: Let E = {e, : a € I} be an orthonormal basis such that Beg = Ac€a for 
any a € I. It follows easily that B*e, = \eg, so [B*, BIE = {0} and by linear 
and continuous extension, B is normal. Since o(B) is closed by definition and 
contains o,(B), it is sufficient to show that o(B) C o,(B). Suppose » ¢ op(B) 
so |A—Aaq| > 6 for some 6 > 0 and all a € I. For any x € H, there is at most 
countable I, := {a;} CI such that (eo,2) = (e€o,Bzr) =0 for a ¢ I, ; then 


(BA? = YF lea; (B— r)zx)? = So lag — AP H(Cagsa)[? > & Ila? 
j=1 j=l 


and A ¢o(B) in view of Theorem 3. 


3.5 Compact operators 


A linear everywhere defined operator C on H is said to be compact if it maps 
any bounded subset of H to a precompact set. Since H is a metric space, this is 
equivalent to the requirement that any bounded {z,} C H contains a subsequence 
{n,} such that {Czxn,} converges. The set of all compact operators is denoted as 
K(H) ; alternatively we use the symbol J..(H) or Jo... A compact operator cannot 
be unbounded since any precompact set is completely bounded by Corollary 1.3.7a, 
and therefore bounded, so we have 


K(H) c B(H). 


If dim#H = oo the inclusion is nontrivial, as the unit operator or-a projection to 
an infinite-dimensional subspace of H illustrate. On the other hand, a bounded 
operator is always compact if it is finite-dimensional, i.e., dim Ran B < oo since 
BM is a bounded subset of Ran B for any bounded M C H, and is therefore 
precompact (Corollary 1.3.7a). In particular, the above inclusion turns into identity 
on a finite-dimensional H. 


3.5.1 Theorem: An operator is compact iff it maps any weakly convergent sequ- 
ence to a convergent one. 

Proof: Suppose that C € K(H) and zp + 2x does not imply Cz, + Cx. Then 
there is a positive € and a growing sequence {n,} of natural numbers such that 
|Cz — y|| > € holds for y, := Crp, ,k = 1,2,.... In view of Theorem 1.5.12b, 
{Zn,} is bounded, so one can select from {y,} a subsequence {y,,} which con- 
verges to some y. This means, in particular, y,, — y. On the other hand, 
Cz, — Cx since C’ is bounded, so together we get y = Cx in contradiction 
with the assumption. The opposite implication follows from the reflexivity of H 
(cf. Theorem 1.5.14b): one can select a weakly convergent subsequence {zp,} from 
any bounded {zn} C H, and {Czp,} converges by assumption. 


3.5.2 Theorem: K(H) is a closed subspace in B(H) with respect to the operator 
norm topology. If C € K(H), then its adjoint C* is compact, as are the operators 
BC and CB for any bounded B. 
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Proof: The vector structure of K(H) and compactness of BC, CB follow from 
the previous theorem. If a sequence {z,} converges weakly to some z, we have 
\|C*x_ — C*x||? < |lz_q —2| ||CC*(an — x)|| ; however, {zp} is bounded and CC” is 
compact in view of the just proved result, so CC*z, — CC*x. This implies that 
|C*x, — C*x|| — 0, which means C* is compact too. 

It remains to prove that K(H) is closed. Suppose that |/C, — Bl| — 0 for 
{C,} C K(H) and B € B(H), and let M C H be bounded, ||z|| < K for all 
xz &€ M. We need to find a finite e-lattice to the set BM for any e > 0. There 
is n(e) such that ||C,-) — Bl] < gs, and since C,e) is compact, there is a finite 
§-lattice N for Cre)M, t.e., we can find y € N such that ||Cyez — y|| < § for 
any z € M. Then ||Br— yl] < ||B — Cyey|l zl] + |Caez — yll < €, so N is the 
sought €-lattice. §§ 


3.5.3 Remark: Referring to Appendix B.1 for the appropriate definitions, we can 
state the above theorem concisely: K(H) is a closed two-sided *-ideal in B(H#). 
We shall use this algebraic terminology also in the following section. 


3.5.4 Corollary: If C is a compact operator on an infinite-dimensional Hilbert 
space H, then 0 € o(C). 


Proof: A bounded inverse C~! does not exist, since otherwise C7'C = I would be 
compact. & 


Finite-dimensional operators are more than a simple example of compact oper- 
ators. They allow us to approximate them, which is extremely important, e.g., in 
practical solutions of operator equations. 


3.5.5 Theorem: An operator B € B(H) is compact iff there is a sequence {B,} 
of finite-dimensional operators converging to it with respect to the operator norm. 


Proof: Let C € K(H) with dimRanC = oo and denote by {e,;}52, an or- 
thonormal basis in EH := RanC (cf. Problem 33). Let E, be the projection 
to {€1,...,€n}in, then E, ++ E and E,C > EC (Problem 32), so {E,C} is the 
sought sequence. The opposite implication follows from the operator-norm closed- 
nessof K(H). & 


Next we are going to show what the spectrum of a compact operator looks like. 
We start from an auxiliary result. 


3.5.6 Lemma: Ran(C — 4) is closed for a compact C and any nonzero \€ C. 


Proof: Let {an} C (Ker (C—A))+ be a sequence such that (C—A)ta =: yn — y. If 
{xn} is bounded, then due to compactness there is a subsequence {2p,} such that 
{Czn,} is convergent, and therefore x,, = {(C2n,—Yn,) converges to some x € H. 
Then y = limpoo(C — A)tn, = (C — A)x, so y € Ran (C — X). It remains to check 
that {z,} is bounded. Suppose that there is {x,,} such that limp.» ||zn, || = 00. 
We set 2p := In, /||2n,||. There is again a subsequence {2,1} such that {Czxqy} 
converges, and in the same way as above, we find that zu) — z € (Ker(C — A))+ 
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and |{z|| = 1. However, this is in contradiction with 


(C —pame=diien (C eae jim [latmx "Yn =O. Oe 


3.5.7 Theorem (Riesz—Schauder): If C is a compact operator on H, then 
(a) any nonzero point of the spectrum is an eigenvalue, o,(C) \ {0} = o(C) \ {0}, 


(b) any nonzero eigenvalue has a finite multiplicity, 
(c) the spectrum accumulates at most at zero, 


(d) the set of eigenvalues is at most countable, o,(C) = {Aj : 7 =1,...,N} with 
N < oo. The eigenvalues can be ordered, |A;| > |Aj41|, and lim; A; = 0 
if N =oo. 


Proof: Assume that o(C) contains a nonzero » which is not an eigenvalue so the 
inverse (C — \)~! exists and Ran(C — A) # H. We define Gn := (C— A)"H, n= 
0,1,...; then (C—A)"—(-—A)” is for n > 1 a polynomial in C' without an absolute 
term, so it is compact and G, is closed by Lemma 6. Furthermore, G41 C Gp and 
Gn+i #Gn for any n, which follows from G, = Ran(C— A) #H and the existence 
of (C—A)-". Hence there are unit vectors 2p € GaiNGt+,n=1,2,..., and {rn} 
is an orthonormal set, so zn — 0 and Czp, — 0. Now we take the decomposition 
Crtn = AXn + (C — r)Zn, where (C — A)tn € Gr is orthogonal to x, ; it means 
|Cap|| > |A| |lvn|| = A]. This is in contradiction with Cz, — 0 thus proving (a). 

Let » € o,(C) \ {0}. Since C’ acts as a \-multiple of the unit operator on 
the corresponding subspace, we obtain (b). To prove (c), we have in view of (a) to 
check that the eigenvalues cannot accumulate at a nonzero 4. Suppose that there is 
{An} C op(C) such that An # Am for n#m and A», — A. The corresponding set 
of eigenvectors {rz, : n = 1,2,...} is therefore linearly independent; applying the 
orthogonalization procedure we get an orthonormal set {y, : n = 1,2,...} such 
that (C—A)yn € {y1,---5Yn}un- Then Cyn = AnYnt+(C—An)¥n is an orthonormal 
decomposition, so |/Cyn|| > |An| — |Al. However, this contradicts the condition 
Cyn — 0, which follows from the orthonormality of {yn} and the compactness of 
the operator C’. 

Consider finally the sets M, := {A € o,(C) : |A} > n7!}. Due to (c), they 
have no accumulation points, and since they are bounded, they must be finite. 
Hence o,(C) \ {0} = U%, M, is at most countable; if it is infinite, we have 
limjsoo Aj = 0 for any numbering of the eigenvalues. Passing to the disjoint sets 
Ny := My, Ningi = Mmai \ Mn, = 1,2,..., we see that the eigenvalues may be 
numbered so that |A;| > |Ajzi!. Wf 


3.5.8 Corollary (Fredholm alternative): Consider the equation 
z-—-ACr=y 


for a compact operator C’, a complex A, and y € H. One and only one of the 
following possibilities is valid: 
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(i) the equation has a unique solution x, for any y € H, in particular, zo = 0, 
(ii) the equation with a zero right side has a nontrivial solution. 


Proof: Everything follows from part (a) of the theorem: if A~' ¢ o(C), then 
ty = —\71(C — d7")y ; in the opposite case A~' is an eigenvalue and any of the 
corresponding eigenvectors solves the equation with y=0. & 


For compact operators, a converse to Theorem 3.4.7 is true. 


3.5.9 Theorem (Hilbert-Schmidt): Any normal compact operator has a pure point 
spectrum. 


Proof: Let {An : n= 1,...,N} be the set of nonzero eigenvalues of a normal oper- 
ator B, L, the corresponding eigenspaces, and m, = dim Ly < oo the multiplicity 
of An. The subspaces L, are mutually orthogonal and contained in (Ker B)+. We 
have to check that G := @%2, Ln equals (Ker B)*, since the sought orthonormal 
basis will be then obtained as a union of bases in L, and a basis in KerB. 

By definition, G C (Ker B)+. Each of the subspaces L, is simultaneously an 
eigenspace of B* corresponding to A, ; since the set {An} is bounded, the subspace 
G is B*-invariant. Using Theorem 3.4.2 again we see that G+ is B-invariant. Hence 
B}G+ is a normal compact operator with no nonzero eigenvalues, and therefore 
B}G+ =0 (Problem 34). In other words, G¢ Cc KerB,so GD(KerB)+. § 


With Theorem 7 in mind, we shall always assume that the eigenvalues of a nor- 
mal compact operator B are ordered, |An| > |Andi|. Denoting Jg := dim(Ker B)-, 
we can construct the sequence {A(j pas in which any nonzero eigenvalue is repeated 
as many times as its multiplicity is, and |A(j)| > |AY + 1)|,7 = 1,...,Jg. The 
corresponding eigenvectors which form an orthonormal basis in (Ker B)+ can be 
numbered so that 

Be; = A(j)e; ’ j= 132,..... age 
Using this notation, we can derive a useful expression of the eigenvalues representing 
a modification of the minimaz principle, which we shall discuss in Section 14.2. 


3.5.10 Corollary: We have 
AG)| = min €2(G)), F=1,2,..-5Jp 
where G; C H is any subspace of dimension < 7 — 1, and 
€a(Gj) = sup { ||Bal| : xe G+, ||xl| =1} 


(the supremum can be replaced by maximum — cf. Problem 37). In particular, 


|A(1)] = [BIL 


Proof: Denote F; := {e1,...,€;}in. In view of Problem 18, there is a unit vector 
y € F;G} for which we get 


én(G;)? > ||Byll? = 


Yen Bee] = S21) (enw > AG). 
=] k=1 
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On the other hand, any unit vector + € F;L, satisfies 


Bal? = » (Brena)? = SRP Kewa)? < AGP, 


KF 


so ming, £2(9;) = €a(F;-1) =|AQy)|. 


The existence of a basis of eigenvectors allows us easily to express the action 
of a normal compact operator. It appears, moreover, that a similar formula can be 
written for any compact C. To derive it we use the fact that |C'|] is compact too 
(Problem 35), so 


Jo 
IClz = PE, H(J) (e5, 2) e; 


holds for any z € H, where Jo = dim(Ker|C|)+ = dim(KerC)+. The positive 
numbers p(j) = uc(j), 7-e., the nonzero eigenvalues of |C| are called singular 
values of the compact operator C. We assume they are ordered, p(j) > w(j +1), 
and |Cle; = u(j)e; for 7 = 1,...,Jc. Furthermore, using Problem 14 we get the 
following minimax-—type expression, 


yo(J) = min {sup{ Cal] : x € G7, lle] =1}} . (3.9) 


The vectors e; belong to (Ker C)+, which is the initial space of the partial isometry 
in the polar decomposition C = W|C|. Then the vectors f; := We;, 7 =1,...,dce, 
form an orthonormal basis in Ran C’; we can also write them as 


fy = wy *Ce;. 


Now applying W to the above expression of |C|z, we get the sought formula, 
Jo 
C= = uj) (e;, ‘) Si ’ (3.10) 
j=l 


which is called the canonical form of the compact operator C’. Notice that it 
has a practical importance. If Jc = oo, the partial sums of this series yield a 
sequence of finite-dimensional operators which is easily seen to converge to C’ in 
the operator norm. Hence we have a concrete form of the approximating sequence 
whose existence is ensured by Theorem 5. 


3.6 Hilbert—Schmidt and trace—class operators 


Now we are going to discuss two important classes of compact operators; to make 
the formulation easier, H in this section will always mean an infinite-dimensional 
separable Hilbert space. An operator B € B(H) is called Hilbert-Schmidt if 
there is an orthonormal basis Eg = {f;}$2; C H such that D§2, ||Bfj||? < 00. 
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The set of all Hilbert-Schmidt operators on a given # is denoted as J2(H) or 
simply J2. 


3.6.1 Proposition: Let B € Jo. Then B* € Jz, and furthermore 


» |lBe;l|? < 00 


j=l 


for any orthonormal basis € = {e;} C H, with the sum independent of €. 

Proof: We denote the sum of the series in question by N¢e(B). The Parseval relation 
gives ||Be;||? = x |(fe, Be;)|? = Xe \(B*fe,e;)|?, and since a series with non- 
negative terms may be rearranged, we get 


Ne(B) = y 2 \(e;, BY fe)? de |B* fell? = Neg(B’). 


Choosing first E = €g we find Ne,(B*) = Ne,(B) < co, so B* € Jy, and using 
the last identity, we get Ne(B) = Ne,(B) forany €. Of 


Since ,/N¢(B) does not depend on the choice of € , we may use it to define the 
Hilbert-Schmidt norm of an operator B by 


es 1/2 
|Blla = (x \Be* (3.11) 
j=l 


3.6.2 Theorem: J2(H) is a two-sided +—ideal in B(H) and ||-||2 is a norm on it 
which fulfils 


|Blle = |B 


for all B € J2. Any Hilbert-Schmidt operator is compact, J2(H) Cc K(H). 
Proof: If C € Jz, the relation ||(B+C)e;\|? < || Be;||? + 2||Be;|| ||Ce;|| + \|Ce,||? in 
combination with the Holder inequality yields ||B + Clo < ||Bll2 + ||Cll2. We have 
|oBll2 = |a|||Bll2, and ||B\l2 = 0 implies Be; = 0,7 = 1,2,..., 80 B=0 by 
continuity; thus (Ja, || - lz) is a normed space. For any unit x € H we can find an 
orthonormal basis {f;} such that r= f,, then ||Bz|| < (, || B fell?) = ||Bllo. 
The inequalities ||DBe;|| < ||D|| ||Be;|| yield DB € J) for any D € B(H). 
We know already that B € Jo implies B* € J ; it follows that D*B* € Jo and 
BD = (D*B*)* € Ja. Finally, let E, be the projection to {e1,...,en}in; then 
B, := EnB is a finite-dimensional operator and 


oo 
|(B - B,)al|? = x | \(e;, Ba)? < Ix? So [B*e,|?. 


j=n+1 j=n+1 


Since B* € J,, we get Bz — B,so B is compact by Theorem 3.5.5. 
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3.6.3 Remark: Since any B € J is compact, there is an orthonormal basis formed 
by the eigenvectors of |B| ; using it we obtain the following useful expression, 


ee 1/2 
|Bll2 = (= Hot . 


=1 


This provides us with a criterion for a compact operator to belong to J (Prob- 
lem 38) showing, in particular, that (HH) C K(H) is a proper inclusion and Je 
is not complete with respect to the norm || - || (see also Theorem 6.4.8). It is clear 
from the definition that the norm ||-||2 satisfies the parallelogram law, and therefore 
it is generated by the scalar product 


(B,O)2 = S> (Bes, Ce,) = Y (ey, BYCes) ; (3.12) 


j=1 


we can readily check that the series converges absolutely and its sum is independent 
of the orthonormal basis used. 


3.6.4 Proposition: .J(H) equipped with the inner product (-,-)2 is a Hilbert 
space. 

Proof: Consider a sequence {B,} C J which is Cauchy with respect to || - |lo. 
In view of Theorems 2 and 3.5.2, it is Cauchy with respect to the operator norm 
too and converges to some B € K(H). Choose € > 0 and an orthonormal basis 
{e;} C H, then 


N 
Se. (Ba - Bm)e;\l? Sal Ba = Brall3 < é 
jG 
holds for all m,n large enough and any N = 1,2,.... Performing first the limit 


n— oo, then N — 00, we get BE Jo and ||B— Brille < © for a sufficiently large 
m, 2.e., limmo ||B — Bmlle = 0. 


Next we are going to derive a functional realization of Hilbert-Schmidt operators 
on aspace L?(M,du) with a o-finite measure p such that L?(M,dy) is separable 
(cf. Problem 5). 


3.6.5 Theorem: An operator B € B(L?(M,dy)) is Hilbert-Schmidt iff there is 
kp € L?(M x M,d(u® p)) such that 


(Bf\(z) = f ko(e,y) £) duly) 
for all f € L?(M,dy). In that case, 


181g =f, Wee(a,y)Pd(u © u)(x,0). 


84 CHAPTER 3. BOUNDED OPERATORS 


Proof: Choose an orthonormal basis {gn} C L?(M,du) =: L*; then the vectors 
Rinn : Amn(£,Y) = 9m(t)gn(y) form an orthonormal basis in L*(M x M,d(u@y)) =: 
12. Given B € Jo(L*), we define bmn := (9m,Bgn)- Since ||Bllf = Xm lbmnl? < 
oo, the relation 


kp = Sebnalinn 


defines an element of L2,, and this in turn corresponds to an integral operator of 
Problem 5 which we denote as Kg. Using the Fubini theorem, we get 


(Gms Kagn) = ff Pranl@y) ke (esy) d(w® n)(2, 9) 


(hme ka)e = bmn = (9m, Bgn) ; 


and since the two operators are bounded, one has B = Kg. On the other hand, 
using Problem 5 we associate an operator B, := K with any k € L2,. One more 


application of the Fubini theorem gives 


lg = D0 (hn; kel? 


2 


I 


S| fi, dm l@) ( J, Rew) anv) deta) ) dpe 


> (Begall? = (Bell. 


This finishes the proof, and shows that the Hilbert spaces Jo(L°(.\f.dy)) and L2, 
are isomorphic by themap Brerkg. §f 


Another important class of compact operators is obtained if we generalize the 
notion of the trace known from linear algebra. Consider first a positive operator 
A € B(H) and set 

TrA := } 7 (e;,Ae;), 
j 
where {e;} is an orthonormal basis in H. The sum (either finite or infinite) makes 
sense and is independent of the choice of the basis; it follows from Proposition 1 
applied to B= V/A. 

An operator B € B(H) is said to be of the trace class if Tr|B| < oo ; the set 

of all such operators on a given # is denoted as Ji\(H) or J, . 


3.6.6 Proposition: (a) Any trace-class operator B is Hilbert-Schmidt, and there- 
fore compact, Ji(H) C Jo(H) C K(H), and 


2 oe) 
Tr/B| = |] ViBI[, = Lo ea). TW1Bl > [Bile > HBT. 
j=l 
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(b) An operator B € B(H) belongs to the trace class iff it is a product of two 
Hilbert-Schmidt operators. 


Proof: Part (a) is checked easily. It implies ,/|B| € J for any B € J; hence 


by polar decomposition, B is the product of W,/|B| and ,/|B|. Conversely, as- 
sume C,D € J. Using the polar decomposition once more, CD = W|CD\, in 
combination with the Schwarz and Hélder inequalities, we get 


Tr|CD| = 7 |(e;, W*CDe;)| 
j 


1/2 
< (x |C*We;||? 0 ay = ||D\l2|C"Wlz < 00 
j k 


because C*'WE FR. Of 


3.6.7 Theorem: 7,(H) is a two-sided *-ideal in B(H). It is a complete space 
with respect to the norm ||- ||; := Tr|-]. 

Proof: The algebraic property follows from part (b) of the previous proposition. 
Using part (a) together with (3.9) we find wa+c(J) < ua(y) + uc(J) , which implies 
Tr|B+C| < Tr|B| + Tr|C| ; the other properties of the norm are easily verified. 
To prove completeness, we can use continuity of the singular values with respect to 
the operator norm (Problem 36), and repeat the argument from Proposition 4. 


Now we are able to extend the notion of trace to nonpositive operators, thereby 
justifying the name given to the set 71. 


3.6.8 Theorem: (a) Let B be a trace-class operator, then 


TrBic >. (ey Bey 


ge! 


is well-defined in the sense that the series on the right side converges absolutely 
and its sum is independent of the choice of the orthonormal basis {e;}. 


(b) The map B + TrB is a bounded linear functional on (A, || - ||1) with the 
unit norm. Moreover, the relations 


Te* =r, Tr(6C) eal (@a 
hold for any BE 4,, C € B(H). 


Proof: Consider the orthonormal bases {e;} and {f;} where the former is arbitrary 
and the latter is formed by the eigenvectors of |B|. First we check the absolute 
convergence, 


yy \(e;, Be;)| = a ode, |( Berrie) fese;)| a = aa \(e;, B fe) (fees €;)| . 
j j_k sje 
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Rearranging the last series, we get the estimate 


1/2 
Yo enBey| <= > (x (ef ~ lento = » |B fell = Tr [Bl 
j k j H 


which shows that the series on the left side converges, and at the same time the 
double series ;, (ej, Bfk)(fe,€;) converges absolutely, so its sum is independent 
of the order of summation; it follows that 


Be) => Ven Bh ipe= XY (reperBip=—>_ fereye), 
j jk k jj i 


which proves part (a). 
The linearity of Tr(-) is obvious. Using the polar decomposition B = Wa|B], 
we obtain |Tr B| < ©; wa(y)|(f;,Wef;)| < ©; way), which shows that 


[Tr B| < Tr|Bl ; 


On the other hand, Tr £ = 1 for any one-dimensional projection. The first one 
of the remaining relations follows directly from the definition. If U is a unitary 
operator, we have Tr (BU) = ; (e;, BUe;) = ©; (Ue;, UBUe;) = Tr(UB) , and 
since any bounded operator can be expressed as a linear combination of four unitary 
operators (Problem 22), the theorem is proved. § 


Let us finally mention a particular family of trace-class operators which will be 
important in quaantum—mechanical applications: W € J,(H) is called a statistical 
operator if it is positive and fulfils the normalization condition TrW = 1. Its 
eigenvalues therefore coincide with its singular values, w; = uw(j), with a posi- 
ble repetition according to their multiplicity, and correspond to a complete set of 
eigenvectors, We; = w;e;. The normalization condition then reads 


yo ie 
3 


It follows trivially that 0 < w; < 1,7 = 1,2,..., and moreover, w; = 1 can hold 
only if the other eigenvalues are zero, i.e., if W is a one-dimensional projection; 
it is further equivalent to the condition TrW? = TrW. There is an alternative 
geometrical formulation. 


3.6.9 Proposition: The set W of all statistical operators on a given H is convex. 
An operator W € W is one-dimensional projection iff it is an extremal point of 
W, we., if the condition W = aW,+(1—a)W2 with 0<a<1 and W,,W,.eW 
implies W, = W2=W. 

Proof: If W? # W, then W has an eigenvalue \ € (0,1) corresponding to a 
one-dimensional projection E. We can write W = XE + (1 — A)W’, where W! := 
(1—)~!(W — AE) is a statistical operator, so W is not an extremal point. On the 
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other hand, suppose that W? = W and W = aW,+(1—a)We2 for some a € (0,1). 
We have 


TrW? = o? TW? + 20(1—a) TW,W2 + (1-0)? TrW2 = 1, 


and since TrW,W2 < 1 for any statistical operators Wi,W.2 (cf. Problem 40), 
this is possible only if TrW? = TrW? = 1. Then W, is a projection to a one- 
dimensional subspace spanned by a unit vector e; and the condition TrW,W2 = 1 
implies |(e1,€2)| = 1, so the two vectors differ at most by a phase factor. 


Notes to Chapter 3 


Section 3.1 The adjoint operator can be defined even without the inner—product struc- 
ture. Let +, Y be Banach spaces, then the dual (or Banach adjoint) operator to a given 
B € B(¥,y) is defined by (B*g)(x) := g( Bz) for all x € XY, g € Y*. By definition, 
B* is linear and ||B*|| < ||Bl|, so it belongs to B(Y*, V*); using Corollary 1.5.8 we can 
check that ||/B*|| = ||B\|. Notice that the map B+ B* is linear, in distinction to the 
Hilbert-space adjoint discussed in this section. The latter belongs to B(H), however, 
rather than to B(H*), and the correspondence between the two adjoints is in this case 
given by the antilinear isometry between H and H*. 

The other notions discussed here also admit generalizations. The integral representa- 
tion of Proposition 5 extends to any bounded mapping from L1(X,du) with a o-finite pu 
to L'(Y,dv), where Y is a metric space and v a Borel measure on Y — cf. [DS 1], 
Sec.VI.8. The matrix representation can be used for the operators B € B(¥,)), where 
«x, are Banach spaces having the so-called Schauder basis — cf. [ Tay], Sec.4.51. Fi- 
nally, strong and weak operator topologies can be defined on B(%,Y) for any Banach 
X,Y; we have to replace p, of the above definition by pz : pr(B) = ||Ba|ly and psy by 
Pte ? Pf2(B) =|f(Bx)| with x ¢ 4X, f € Y*. The corresponding topological spaces are 
discussed in [DS 1], Sec.VI.1. 


Section 3.2 Basic properties of the numerical range can be derived easily (Problem 27); 
they extend mostly to unbounded operators — cf. [Sto], Sec.IV.3. Using the numerical 
range, we can define some classes of operators. For instance, B € B(H) is antihermitean 
if O(B) is a subset of the imaginary axis; this is equivalent to the condition B* = —B. 
Other definitions of this type will be mentioned in the notes to Sec.4.1. Projections can 
generally be associated with any decomposition H = L+ L’, not necessarily orthogonal; 
the projections introduced in this section are then called orthogonal projections. Since we 
shall use almost exclusively the latter, we drop the adjective. 


Section 3.8 An antilinear isometry U : H — @ is called an antiunitary operator. 
Examples of such operators are, for instance, the complex conjugation on L?(M, du) or 
the map H* — H given by the Riesz lemma. As in Problem 20, we can check that 
U:H—-G is antiunitary iff it is surjective and (Uz,Uy)g = (y,x)y for any z,yeH. 


Section 3.5 The definition of compactness extends to Banach space operators C € 
B(4#,y). The present proofs show that the set of compact operators is closed again and 
the sufficient condition of Theorem 1 remains valid. Compactness is also preserved when 
passing to the dual operator; this is known as the Schauder theorem — cf. [Ka], Sec.III.4; 
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[ Yo], Sec.X.5. If the space is reflexive, Theorem 1 holds in both directions. The Riesz— 
Schauder theorem extends to operators from B(¥’) — cf. [ Yo], Sec X.5. Corollary 10 
also follows from the minimax principle: since B is normal, |,;| are eigenvalues of |B], 
so one has to apply Theorem 14.2.1 to the operator B*B. 


Section 3.6 An operator C € B(X,Y), where ¥, Y are arbitrary Banach spaces, be- 
longs to the trace class if there are bounded sequences {fn} C **, {yn} C Y and a 
sequence of complex numbers fulfilling 3°; |en| < oo such that Cr = 7, Cn fn(z)yn for 
any « € X. Such an operator is approximated by finite-dimensional operators and is 
therefore compact. In a similar way, we define a trace-class map from a locally convex 
space to a Banach space; using this A. Grothendieck introduced the important notion of 
nuclear space — cf. [ Yo], App. to Chap.X. 

The definition of the classes J, and J with the help of singular values suggests 
a natural generalization: we define the class J, for any real p > 1 as consisting of 
the compact operators C whose singular values fulfil {uc(j)} € &. Since the set of 
nonzero singular values of a compact C in a nonseparable #1 is at most countable, the 
definition makes sense in any Hilbert space. Investigation of the classes 7, was initiated 
by J.W. Calkin in the forties — cf. [Ca 1]. The recent state of knowledge in this field 
including numerous applications is summarized in the monograph [Si 3]. The reader can 
easily check that any J, is again a two-sided *-ideal in B(H), which is complete w.r.t. 
the norm 


1/p 
ICllp := (= poor) | 


These norms fulfil ||Bl| < ||Blla < ||Bllp for any g => p > 1 to which the inclusions 
K(H) D> Jg D> Jp correspond. Compact operators are associated with £° sequences of 
singular values; this justifies the alternative notation K(H) =: Joo(H). The factorization 
assertion of Propositon 3.6.6 generalizes as follows (see [ We], Sec.7.1): to any p,q > 1 
fulfilling pq >p+q set r:= pq/(p+q), then B € J, iff there are C € Jp, DE J such 
that B=CD. 

Since J2 has a Hilbert-space structure, any bounded linear functional f on it is of 
the form f = Tr(Cy-) for some Cy € Je, and |lf|| = ||Cy\l2. Similarly we can prove 
that Jj is linearly isometric to B(H) (Problem 42), while 7, is isometric to Jg with 
q:= p/(p—1), and J} to Joo := (K(H), || - ||). For more details see [Si 3], Sec.3. 


Problems 


1. Multiplication of bounded operators (defined as a composite mapping) is jointly 
continuous with respect to the operator norm: B, — B and C, ~ C imply 
BaCn - BC. 


2. Let D be a proper subspace in H. Then any Bo € B(D,H) has just one extension 
B € B(H) such that B} D+ =0; this operator satisfies ||_B|| = || Boll . 


3. Let L,L’ be dense subspaces in . The operators B,C € B(H) are equal provided 
(x, By) = (x,Cy) for all (x,y) € L x L', or (x, Br) = (2,Cx) for all rE L. 
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a 


go 


10. 
11. 


12. 


19: 
ive 


15. 


16. 


We have BM c BM for any B € B(H) and M C4; the inclusion turns into 
identity if B is invertible. 


Consider 1 = L?(M,du) with a o-finite ». To any ke L?(M x M,d(w@y)) we 
define the Hilbert-Schmidt integral operator by (Kf)(x) := fy k(x, y) f(y) duly) - 
It satisfies |||? < ayy ay |KI? d(u @ w) , and the adjoint operator to K is given by 


the formula (K*f)(x) = fag K(y, ©) f(y) duty) . 


The operator Pz := (y,:)x is bounded for any x,y € H, ||Pryl| = [loll llyi]. It 
satisfies Ps.—= Powand PryPuy = (y;t) Pro - 


The shift operators of Example 3.1.3 have the following properties: 


(a) |[S™|| = ||(S*)"|| = 1 and (S*)"S" = J. Furthermore, S"(S*)” is a partial 
isometry from H to {e1,... en}+, . 


(b) s-limpoo(S*)* = 0, while {S”} does not converge strongly. 


Using the shift operators of the previous problem, define Q, := S"(S*)*. The 
sequence {Qn} of projections converges strongly to zero, but it does not converge 
with respect to the operator norm. 


Let dim # = oo. Using Problem 7 show that 


(a) the map B+ B* is continuous with respect to the operator norm and weak 
operator topologies while it is not continuous with respect to the strong ope- 
rator topology, 


(b) By, — B and C, + C does not imply B,C, ~ BC. 
The topologies tT ,7,, and 7, on B(H) coincide for dimH < oo. 


A bounded operator B on a separable H is Hermitean iff its matrix representation 
with respect to any basis fulfils Bj, = Bey, j,k =1,2,.... 


The product of Hermitean operators A, B is Hermitean iff they commute, AB = 
BA. The product of commuting positive operators is positive. 


Let \ be an eigenvalue of a positive A; then VX is an eigenvalue of VA. 


We have {|Bz|| = |||Blz|| for any x € H and ||Bl| = |||B) ||. The relation |B| = B 
holds iff B is positive. 


Find bounded operators B,C such that 
(a) |BC| # |BIIC], 


(b) |B*| # BI, 
(c) |B+C| < |B] + |C| is not valid. 


Hint: In the last case use B = 03+], C =0;—I, where the o; are Pauli matrices. 


If L isa subspace in H and E is the projection to L, then ||Ex|| = sup{ |(x,y)| : 
y € L, |\y|| =1} holds for any x eH. 
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Prove Proposition 3.2.9. 
Let E,F be arbitrary projections on #. Prove: 


(a) Ran EN Ker F = {0} implies dimE < dim F, 
(b) RanE Ker F # {0} implies ||E — F|| =1, 


(c) ||E—F|| = max{ogr, ore}, where we have denoted cgr := sup{ ||(J— F)z|| : 
x € EH, ||2|| = 1} =sup{d(z, EH) : 2 € EH, |\x|| =1}. 


Hint: To prove (a) for a non-separable # use the following assertion: if J is an 
index set of infinite cardinality m and {M,: a€J} with card M, < m for each 
a, then card User Ma Sm. 


Suppose that a sequence {F,,} of projections, not necessarily monotonic, converges 
weakly to a projection LE, then FE =8-limpookn. 


Prove: (a) If the map U : H — # is surjective and preserves the inner product, 
then it is linear. 


(b) A unitary U which satisfies U? = I is Hermitean; on the other hand, any 
U € B(H) which is simultaneously unitary and Hermitean satisfies U? = I. 


(c) If U is unitary, UL+ =(UL)+ holds for any subspace L C H. 


The set U(H) of all unitary operators on a given # is a group with respect to 
operator multiplication. If 4 = L?(UR), the set {Ue: yp €®} of Example 3.3.2 is 
a subgroup in U/(L?(JR)). Which topology makes U(H) a topological group? 


Any Hermitean operator is a linear combination of two unitary operators. 
Hint: If ||Al| =1, consider A+iVI — A?. 


The Fourier—Plancherel operator F on L?(JR™) satisfies F? = R, where R is the 
reflection operator, and F4 =[. 
Hint: Show that Ff = F-'Rf for fe L?nL'. 


The polar decomposition of Theorem 3.3.5 cannot be generally replaced by B = 
|B|W. For a Hermitean A, however, one has W* = W and W|A| =|A|W. 
Hint: Try a shift operator, and for the second part use Proposition 3.2.6. 


Let V,,7r = 1,2, be isometric operators with the domains D, = E,H. If D, 1 Do 
and RanV, 1 RanV2, then W := V,£,+V2E2 is a partial isometry with the initial 
subspace D; © D2 and final subspace Ran Vi @ Ran V2. 


Given a sequence s := {A;} € €*, one can construct an operator B, with a pure 
point spectrum, o(B,) = 3, and ||B,|| = sup,|Aj|, on any infinite-dimensional 
Hilbert space #. 

Let B € B(H). Its numerical range satisfies 


(a) ©(B) is a convex set which is generally neither open nor closed, 
(b) the spectrum of B is contained in O(B), 
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28. 


29. 


30. 


31. 


32. 
3S. 


34. 


35. 


56. 


37. 


(c) if B is not normal, O(B) need not equal be to the convex hull of o(B). 


Hint: To prove (b), use Theorem 4.7.1; for part (c) consider B = ( : ee ) and 
ial NG 
f malt a) 


If a sequence {Bn} C N(H) converges strongly to B € N(H) ; then B* -, B* 
(compare to Problem 9). 


Prove: (a) Suppose 0 € o(B)\o,(B) for an operator B € N(H) ; then Ran B is 
not closed. Find examples of such operators with (i) a pure point spectrum, 
(ii) a purely continuous spectrum. 


(b) The spectral radius of an operator B € M(H) equals its norm, r(B) = ||B||. 
Hint: (b) Use the relation r(B) = r(B*) and (3.5). 


A bounded operator B is m—dimensional iff there are linearly independent sets 
{fi,---, fm}, {91,---,9m} C H such that B = O%, (fj,-)g;. In that case, we 
also have dimRan B* =m. 


K(H) is closed with respect to neither the strong nor the weak operator topology 
unless dimH < oo. 


Let B, > B in B(H) and C € K(H); then ||B,C — BC|| — 0. 


(Ker C)+ is separable for any compact C. 
Hint: Let {e.} be an orthonormal basis in (KerC)+; then {e,: ||Ceg|| > n—!} 
must be finite for any n. 


A normal compact operator B has at least one non-zero eigenvalue unless B= 0. 
Hint: Consider first a Hermitean B; in the general case use the commutativity of 
Re B, ImB. 


Prove: (a) C € B(H) is compact iff Ce; — 0 holds for any orthogonal set 
{es} c (KerC)*, 

(b) C € B(H) is compact iff |C| is compact, 

(c) a positive operator A is compact iff VA is compact, 

(d) if B>A>0 and B is compact, then A is compact. 


The singular values of a compact operator C satisfy 


(a) uc(7) = ucr(J), 
(b) pac(i) < ||Biluc(7) and ca) < ||Blluc(7) for any B € B(H), 
(c) |uc(Z) — uv(j)| < uo-p(J) < ||C — DI). 


Set A := sup{ ||Cz|| : x € 51}, where C’ is a compact operator and Sj is the unit 
sphere in a closed subspace G C H. Then there is a unit vector y € G such that 
A = ||Cyll. 

Hint: Use the weak compactness of bounded sets in #. 
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38. 


39. 


40. 


41, 


42. 
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Prove: (a) A compact operator B is Hilbert-Schmidt or of the trace class iff 
the corresponding sequence of singular values {j8(j)} belongs to ror, 
respectively. 

(b) The set of finite-dimensional operators is dense in the Hilbert space J. 


(c) The same set is dense in (Jj, || - |l1)- 


Prove: (a) The operator Pry of Problem 6 is of the trace class for any 2,y € H 
and Tr Pry = (y,x). If {e;}92, is an orthonormal basis in 1, then { Pee, : 
j,k =1,2,...} is an orthonormal basis in the Hilbert space J2(H) , which is 
therefore separable. 

(b) Let x++ @ be an antilinear isomorphism of the spaces and H*, and define 
Z®@y:= Pry; then one has Jo(H) = H* @H. Extend this result to a pair of 
Hilbert spaces #1, Ho. 


Let Be B(H) and C € J,(H) ; then 


(a) Tr|C| = Tr|C*|, 

(b) Tr|BC| < ||Bl| Tr|C], 

(c) due to the previous assertion, the maps fc := Tr(-C) and gg:=Tr(B-) are 
bounded linear functionals on B(H) and J\(H), respectively. Their norms 
are ||fcl| = Tr|C| and ||gal| = ||Bll. 


If C is a trace class operator and a sequence {Bn} converges weakly to some B, 
then Tr(B,C) - Tr(BC). 

Hint: Use boundedness of {||Bn||} to check that >>; |(e;.(Bn — B)Ce;)| converges 
uniformly with respect to n. 


The spaces 7;, and B(H) are linearly isometric: for any bounded linear functional 
g on the Banach space (Ji, ||; ||1), there is a bounded By such that g = Tr(B,:). 
Hint: Use the fact that { Pry: z,y € H} is total in J; — see Problem 39. 


Chapter 4 


Unbounded operators 


4.1 The adjoint 


A linear operator T on H is said to be densely defined if its domain is dense, 
D(T) = H ; we denote the set of all such operators as C(H). Any bounded 
operator is densely defined. However, Example 3 below shows that L(H) contains 
unbounded operators as well; in such a case there is no standard way to extend T 
to the whole space space H. 

The main importance of the fact that an operator is densely defined is that it 
allows us to generalize the concept of the adjoint. If T € C£(H) then to any ye H 
there is obviously at most one vector y* such that (y,Tx) = (y*,x) holds for all 
x € D(T) ; the difference from the bounded case is that for some y the vector y* 
may not exist. Using this we define the adjoint operator T* of T by T*y := y* 
with the domain D(T*) consisting of those y for which y* exist. Let us list first 
its elementary properties. 


4.1.1 Proposition: (a) The operator T* is linear and satisfies (y, Tx) = (T*y, x) 
for any x € D(T) and y € D(T"), 


(b) KerT* = (RanT)*, in particular, KerT* is a closed subspace, 
(c) SDT implieseS* CT. 


Proof: Assertions (a),(c) are evident. If y € KerT*, then (a) gives (y, Tx) =0 for 
all x € D(T), so y € (RanT)* ; this condition in turn implies y € D(T*) and 
fy=0- af 


The sum or product of operators T,S € L(H) may not be densely defined, and 
therefore Theorem 3.1.2 is replaced by the following collection of weaker statements, 
the proof of which is left to the reader (Problem 2). 


4.1.2 Proposition: (a) (aT)* = aT". 
(b) T** >T provided T* € L(H). 
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(c) If T is invertible and T~! € L(H), then T* is also invertible and its inverse 
(T+)! = (TY. 


(d) Let S+T be densely defined; then (S + T)* D S* +7” and the inclusion 
turns to identity if at least one of the operators T,S is bounded. 


(e) If T'S is densely defined then (TS)* D S*T* ; the two operators are equal if 
T € B(H). 


The concept of an adjoint allows us to define several important operator classes. 
A densely defined operator A is called symmetric if A C A* ; this is equivalent 
to the condition (y, Av) = (Ay,x) for all x,y € Da. If A € L(H) is equal 
to its adjoint, A = A’, it is called self-adjoint. The corresponding subsets in 
L(H) are denoted as £,(H) and L4a(H) , respectively. Every self-adjoint operator 
is symmetric, La(H) C L,(H) ; for bounded operators the notions symmetric, 
self-adjoint, and Hermitean coincide. 


4.1.3 Example (the operator Q on L?(IR)): We put (Qw)(zr) := xy(x) on the 
domain D(Q) := {~ € L?(R): Jpx?ly(x)|\?dx < 00}. It is densely defined since 
D(Q) contains, e.g., the Schwartz space S(R) , and is symmetric because (¢,Qy) = 
(Q¢,w) holds by definition for all ¢,~ € D(Q). It is also easy to see that Q is 
unbounded: we have ||Q@nl|? = (2n + 1)(2n + 2) for unit vectors dn: @n(x) = 
Qn)))aerte Fl? ne? ee 

We shall show that D(Q*) = D(Q). To any y € D(Q*) there is w* € L? 
such that fp $(x) [xew(x)—-y*(x)| dx = 0 holds for each ¢ € D(Q). Since any L? 
function with a bounded support belongs to D(Q) we can replace ¢ by ¢yn in the 
above condition, where x, is the characteristic function of the interval (—n,n). 
Using the fact that D(Q) is dense in L?(JR) we find xy(r) = w*(x) for aa. 
xz € (—-n,n),n=1,2,...,1¢e, e¥(r) = ¥*(@) ae. in IR. We have y* € L? by 
assumption, so ~ belongs to D(Q) and the operator Q is self-adjoint. 


Let A be asymmetric operator. Any symmetric A’ such that A Cc A’ is called 
a symmetric extension of the operator A. A symmetric operator is mazimal if it 
has no proper symmetric extensions, i.e., if the relation AC A’ fora symmetric A’ 
implies A= A’. By Proposition 1, we have Ac A’ Cc A’ Cc A*, so any self-adjoint 
operator is maximal. On the other hand, there are maximal symmetric operators 
which are not self-adjoint (cf. Example 4.2.5 below). 


4.1.4 Example: Let € := {e;}92, be an orthonormal basis in a separable H. To 
any sequence s := {s;}2, of complex numbers we can define the linear operator T, 
with the domain D(T;) = Etin by Tre; = $;€;,j = 1,2,.... It obviously belongs 
to L(H) ; it is bounded iff the sequence s is bounded. 

Now we want to find the adjoint T?. Let y := Yja1 1;€; belong to D(T*) so 
there is a vector z := 92, ¢je; such that (z,x) = (y,Tsx) holds for all x € Ein. 
Choosing in particular z = e;, 7 = 1,2,..., we get 3jn; = G and the Parseval 
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identity gives D(T*) C D,, where we have denoted 


Dw {2= E607 : Ele? <= | 
p= wah 


the parallelogram identity together with the inclusion € C D, implies that it is 
a dense subspace in H. On the other hand, we have (y,T;2) = D9, 77;8;€; for 
any y € D, and x € €ji,. The sum 3°; 7;3;e; is convergent by assumption and 
determines a vector y, € H, so the last relation can be written as (y, 7,2) = (ys, 2) 
for all x € Ejin ; this means that y € D,. In this way, we have found the operator 
T; : it has the domain D(T*) = D, and acts on any y := 5 me; € D, as 
Tey = jl 15336; - 

Since D, # Erin, the operator T, is not self-adjoint; it is symmetric iff {s;} C 
IR. It is also obvious from the above argument that T, extends naturally to the 
operator T; defined by Tsx := 52, €jsje; for any x := Yo; €;e; € D. =: D(T,). 
We see that T* = Ts where 5 := {3;}. Repeating the above argument we find 
T; =T* so T* = Ts; this means that T, is self-adjoint iff {s;} is a real sequence. 
Other properties of the operators T, can be found in Problem 6. 


The operator T, in the previous example is defined on the whole 1 only if it 
is bounded. Similarly the operator @ of Example 3 is not defined on the whole 
L* (IR) ; for instance, the vector =: w(x) = (1+27)-*/? does not belong to D(Q). 
This appears to be a common property of all unbounded symmetric operators (see 
also Problem 7). 


4.1.5 Theorem (Hellinger—Toeplitz): Let A be a symmetric operator on H with 
Da =H; then A is bounded. 

Proof: A symmetric operator defined on the whole # is necessarily self-adjoint 
and therefore closed (see Example 4.2.1b below); the assertion then follows from the 
closed-graph theorem. § 


4.2 Closed operators 


Now we are going to discuss how the general results on closed operators dealt with 
in Sections 1.6 and 1.7 can be specified for the set C(H) of closed operators on a 
given Hilbert space. Let us start with some simple examples. 


4,2.1 Examples: (a) Suppose 7' is densely defined and {y,} C D(T") is a se- 
quence such that y, — y and T*y, — z. Since (Y,Tx) = (T*yn,x) holds 
for any x € Dr, the limit n — oo yields (y, Tx) = (z,x),s0 y € D(T*) and 
z=T*y. This means that the adjoint 7™ is closed for any T € L(H). 


(b) Consider further a symmetric operator A. Since A®* is closed, A is closable, 


Ac Ac At. Let x,y € D(A), so there is a sequence {r,} C Da such 
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that 2, — 2, Atn — Az, and an analogous sequence for the vector y. The 
relations (yn,Atn) = (Ayn, Zn) then yield (y,Ar) = (Ay,zx) ; hence any 
symmetric operator has a symmetric closure. In particular, any self-adjoint 
operator is closed, A = A* = A. 


Passing to the closure is a standard way of extending an operator. Of course, it 
is by far less universal than the continuous extension of bounded operators. First 
of all, the closure may not exist for a densely defined operator, as Problem 1.61 
illustrates. Moreover, even if it exists its domain is in general still different from 
H. For instance, the closure A of an unbounded symmetric operator A is again 
unbounded and symmetric, so D(A) #'H by Theorem 4.1.5. 

Nevertheless, the closure represents a unique extension for any closable oper- 
ator. This motivates the following definition: a symmetric operator A is called 
essentially self-adjoint (we use the abbreviation e.s.a. ) if A is self-adjoint. 
Since any self-adjoint extension A’ of a symmetric A is closed, we have A C A’ ; 
if A is e.s.a. we get A’ = A. In other words, an essentially self-adjoint operator 
has a unique self-adjoint extension, namely its closure (see also Problem 8). 

The converse is also true: a symmetric operator which has just one self-adjoint 
extension is e.s.a. (see Section 4.7 below). If an operator T is closed, any subspace 
Dc D(T) such that T} D =T is called its core . In particular, D is a core for a 
self-adjoint A iff A} D is e.s.a.; we usually say briefly that A is e.s.a. on D. 


4.2.2 Example (the operators T; revisited): The relation T, = TZ implies that T, 
is closed for any sequence s. The operator T, , from which we started, fulfils Bac 
T, , and therefore T, C T;. On the other hand, any x € D, can be approximated 
by the vectors tp := Djui(e;,x)e; ; then Tr, = DojpetSG(q@)e; — Tye, whieh 
means that x belongs to the domain of T,. Together we find that Ein is a core 
for T,; in particular, that T, is e.s.a. if s is a real sequence. 


Properties of the closure can be investigated using the notion of the operator 
graph introduced in Section 1.6. We define the unitary operator U on H@®H by 
U|x,y] := [—y, 2] ; then the graph of T* can be expressed as [(T*) = (UI'(T))+ 
(Problem 9). Since the orthogonal complement is closed, we again arrive at the 
conclusion of Example la. 


4.2.3 Theorem: Let T € C(H) ; then T exists iff T* is densely defined. If this 
condition is valid, we have T** = T and (T)* =T". 

Proof: If T* € L(H) then T** exists, and T** D> T, and since T** is closed T 
is closable. On the contrary, let T exist, so '(T) = I(T). Using the operator U 
introduced above, we have U*T =I and U(I)+ = (UT)+ for any subspace [ C 
H@®H, and therefore [(T) = F(T) = UP(T) = (U?T(T))++ = (U(UT(T))*)+. 
Substituting ['(7*) for (UI'(T))*+ we get [(T) = (UP(T*))+, and since ['(T) is 
a graph, T* is densely defined and ['(T) = I'(T**) by Problem 9. The second 
relation follows from the first, and from the closedness of the operator T , because 
(T)* = T*** — (Tar = T* == J", | 
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The subset of all closed operators in L(H), i.e., the intersection L(H) NC(H) 
is denoted as £,(#1). Theorem 3 implies the following simple relations between the 
spectral properties of T and T*. 


4.2.4 Corollary: Let T € £,(H) ; then 
(a) A€o(T) iff AXE o(T"), 
(b) ((T — y)~!)* = (T* — fi)“ holds for any regular value y of T. 


Proof: If w € p(T) then Proposition 4.1.2c implies @ € p(T*) and the identity 
of part (b). This also means that \ € o(T*) implies . € o(T). By Theorem 3, 
T* € L(H) and T** = T, so the argument can be repeated with T replaced by T* 
thus giving assertion (a). § 


4.2.5 Example (the operator P): Let J C R be an open interval with the 
endpoints a,b, —co < a < b < o© ; we want to construct a closed symmetric 
operator corresponding to the formal differential expression —i 4. The problem 
clearly consists of finding a suitable domain. It can contain only those 7 € L? 
for which the derivative y~’ is finite a.e. and square integrable in J. The first 
requirement is fulfilled if ~ is supposed to be absolutely continuous in J (see 
Remark A.3.9). We denote 


AG(s) = {y e L?(J) : w absolutely continuous in J, ~' € L*(J) } ; 


this subspace is dense in L?(J) containing, e.g., CS°(J) (cf. Example 1.5.3c). If 
@,w belong to AC(J), an easy integration by parts yields 


(¢, —iy’) = —i[¢, | ae (—i¢', v) ’ 


where we have denoted [¢,~] := limzo- ¢(x)p(x) — lima; O(x)p(z). An anal- 
ogous argument shows that a “ can correspond to a symmetric operator only if 
Rea = 0; with later applications of this operator in quantum mechanics in mind 
we choose a = —1. 

Introducing the operator P : Py := —i’ with the domain D(P) = AC(J), 
we can rewrite the above relation as (¢, Pw) = —i[¢,#] + (P¢, Y) ; this shows that 
P is not symmetric unless J = IR (Problem 12a); it is also clear that a densely 
defined operator P Cc P is symmetric if [¢,%] = 0 holds for all ¢,e of its domain. 
We shall show that P can be chosen in such a way that P* = P. 

Let D be the set of all % € AC(J) with a compact support contained in 
J. The operator P := P) D is densely defined, because Co°(J) C_D. We have 
[¢,%] = 0 for all ¢ € ACU), YE D, and therefore (¢, P=) = (Pd,w), which 
means that P c P*. To prove the opposite inclusion, one has to check that if 
(¢, Pb) = (n,) holds for given $,n € L?(J) and any ~ € D, then 6 € AC(J). 
We take a compact interval K := {a, 6] C J and define on it ia) = fF n(t) dt+c, 
where the number c will be specified a little later. The function nj is absolutely 
continuous in K and (n%)' = xxn holds in the L* sense. Now let wx be an 
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arbitrary function from D with the support contained in K ; then integrating the 
right side of (¢, Pox) = (n,WK) by parts, we get (ni +id, Wy) = 0. Consider next 
the function nj, which is defined by x(x) := Ja(nk + 1o)(t)dt for x € K and 
equals zero outside ts interval. We now choose the number a in such a way that 

ni (3) = 0; then nX belongs to D and we may set wx = n}¢ in which case the 
ae (nt. + id, We) =0 yields nk = —ixxod. Hence ¢ is absolutely continuous 
on K and ¢'(x) = in(x) for aa. x € K. Since K was an arbitrary compact 
interval in J, we finally arrive at the desired result, ¢ € AC os. 

Thus the operator P defined above satisfies Pr=P; ;_it implies in particular 
that o _is closed. In view of Theorem 3, the closure P := P has the same adjoint, 
P* = P, and P* = P. Now we want to show how the domain of P can be 
characterized by means of boundary conditions. 

We have to distinguish the cases when J isa finite interval (a,b), or J = (0,0) 
as a representative of semifinite intervals, or finally J = JR. In view of Problem 12, 
any ~ € AC(J) has one-sided limits at the endpoints of the interval; we denote 
them as w(a),~(b). Let us define 


-| {p € AC(a,b) : oa) = Y(b) = 0} 
D: {wh € AC(0,00) : (0) = 0} 
ACU) 


for the three cases, respectively. Theorem 3 readily implies that the operator P 
is unique in the following sense: if Pp is a closed symmetric _— such that 
P5 = P,then Pp = P. Using this fact we shall check that D(P) = 

We ast Py = Pp D. By Problem 12, this operator is symmetric + PAD P; 
on the other hand, P C Pp implies fi 5h P, soit remains for us to prove that Pp is 
closed. This is obvious for J = IR when Pp = P is self-adjoint. In the remaining 
two cases it is sufficient to check the inclusion D(Pj*) C D. Let ¢ € D(P5*), 

e., (¢, P5v) = (Phd, ) for all » € D(P%) = AC(J). Since P= P and the 
inclusion Pp C P together with the closedness of P imply Ps Cc P we can rewrite 
this condition as (¢, Pw) = (P¢,~) ; from this we see that (g,~| = 0 must hold for 
all » € AC(J). The functions ~ may assume any values at the finite endpoints of 
J ; hence $(a) = $(b) = 0 must be valid for the finite interval and ¢(0) = 0 for 
J=(0,00),i¢, GED. 

Concluding the above argument, we have shown that to any interval J C IR the 
formal expression —1 pa determines just one unbounded closed symmetric operator 
P c P which satisfies the condition P* = P ; its domain is DCP). D:. 

In the case of a finite or semi-infinite J the operator P is not self-adjoint, 
so we have to ask whether it has nontrivial symmetric extensions. If J = (0,00) 
the answer is negative: the domain of such an extension should contain a function 
wy € AC(J) with (0) 4 0 ; however, then [,~] 4 0, so the operator cannot be 
symmetric. The operator P on L?(0,00) thus represents an example of a maximal 
symmetric operator which is not self-adjoint. 

On the other hand, in the case of a finite J = (a,b) the condition [¥,y] = 
|(b)|? — |~(a)/? = 0 can be fulfilled provided ~(b) = 6(a) where || = 1. More- 
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over, if we define De := {w € AC(J) : H(b) = Ov(a)} then [¢,%] = 0 holds 
for any ¢,y~ € De. Hence there is a bijective correspondence between symmetric 
extensions of the operator P on L?(a,b) and complex numbers with |6| = 1 ; each 
of these operators Ps := P)} Dg is self-adjoint. 


In Section 1.7 we described how the spectrum of a closed operator T can be 
classified. If T is a Hilbert-space operator it is useful to introduce another subset 
of o(T). It is called the essential spectrum and denoted as dess(7') ; it consists 
of all A € € to which there is a sequence of unit vectors 2, € Dy which has no 
convergent subsequence and satisfies (T—A)an —- 0. 


4.2.6 Proposition: The spectrum of an operator T € C(H) decomposes as o(T) = 
Cnr Wer) Wier. (Pf) . Moreover, oT) —Gess(TP) \ (on(P)UOr(T)) . 


Proof: If  € p(T) the resolvent Rr(A) is bounded and ||(T—A)z|| > ||Rr(A) ||“! Iz] 
holds for any x € Dr ; from here it follows that A # oess(T), and therefore 
Oess(1) C o(T). Now let 2 be an arbitrary point of the spectrum; if there is 
c, > 0, such that ||(T—A)z]| > cy||z|| for any x € Dr, then Ran(T—)) is a closed 
subspace by Problem 1.64 and the condition A € o(T) implies Ran (T'—A) 4 H, 
i.e., A € o,(T). In the opposite case inf{ ||(T—A)za|| : x € Dr, |lx|] = 1} =0, 
so there is a sequence of unit vectors z, € Dr such that (T—A)z, — 0. If some 
subsequence fulfils z,, — x, then ||z|| = 1 and Tzn, — Az, and since the operator 
T is closed, we get x € Dr and Tx = Ax, i.e., A € op(T). On the other hand, if 
there is no convergent subsequence, A € dess(T') by definition. This proves the first 
decomposition; combining it with the fact that the decomposition (1.6) is disjoint 
we get the remaining assertion. Jj 


In distinction to (1.6), the decomposition of o(T) derived here is in general 
not disjoint. For instance, any eigenvalue of T of an infinite multiplicity belongs 
tO ess(T') because the corresponding normalized eigenvectors form an orthonormal 
set from which it is impossible to select a convergent subsequence. Notice also that 
the last mentioned condition can be replaced in the definition of oess(T) by the 
requirement that the set {rn : n = 1,2,...} is noncompact. The nonexistence of 
a convergent subsequence easily implies noncompactness; on the other hand, if the 
set is noncompact it contains an infinite subset {z,, : k = 1,2,...} without an 
accumulation point; hence the sequence {Zn,} has no convergent subsequence. Let 
us finally mention a remarkable property of densely defined closed operators. 


4.2.7 Theorem: If T € £.(H), then T*T is a positive self-adjoint operator and 
D(T*T) is a core for T ; similarly TT* is positive self-adjoint and D(TT*) is a 
core for T*. 

Proof: We have (x,T*Tx) = ||Tzx||? > 0 for any x € D(T*T), so T*T is positive. 
The relation T** = T in combination with Problem 9 gives [(T)®UT (T*) = HOH, 
so to any x € H, there are vectors u € D(T) and v € D(T*) such that [z,0] = 
(u, Tu] +[-T*v, v] . It follows that u€ D(T*T) and x = (I+T*T)u. The vector x 
is arbitrary, which means that the range of the operator S:= 1+T*T is the whole 
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H. Since T*T is positive, S is invertible; using the fact that (y,Sx) = (Sy,z) 
holds for all x,y € D(T*T) we find that S~! is symmetric. In addition, it is 
bounded by Theorem 4.1.5, because D(S~') = Ran S = H ; thus S is self-adjoint 
by Problem 3 and the same is true for T*T = S—I. 

Next we denote Tp := T | D(T*T) ; then the second assertion is valid iff I(T) = 
I(Tp) , and in view of the inclusion Tp C T and the closedness of T , this is further 
equivalent to the requirement that the orthogonal complement of ['(Tp) in I(T) 
consists of the zero vector only. Suppose that ({z,Tz],[y, Ty]) = 0 holds for some 
xz € D(T) and all y € D(T’T). It follows that 0 = (z,(1+T*T)y) = (x, Sy), and 
since RanS = H, we have x = 0. Finally, T € C.(H) implies T* € £.(H) and 
T** = T, so the assertion remains valid after the interchange of T and T7*. §& 


4.3 Normal operators. Self—adjointness 


The notion of a normal operator introduced in Section 3.4 also extends to unbouded 
operators; we say that an operator T on H is normal if it is densely defined, 
closed, and satisfies T*T = TT*. The set of all normal operators on a given # will 
be denoted as Lr(H). The roles of T and T* in the definition can be switched, 
so if T is normal the same is true for T* and vice versa. Any self-adjoint operator 
is obviously normal, i.e., Lea(H) C Ln(H) C L.(H). The definition includes the 
requirement of equality of the domains, D(T*T) = D(TT*), which is not always 
easy to check. For this reason, the following criterion is useful. 


4.3.1 Theorem: A densely defined closed operator T is norma! iff D(T) = D(T*) 
and ||T'x|| = ||T*z|| holds for all x € D(T). 

Proof: Suppose that T*T = TT*; then the common domain D := D(T*T) = 
D(TT*) is by Theorem 4.2.7 a core for both T and T", and since |/Tx|| = ||T*z|| 
holds for all « € D, the same is true on D(T) = D(T*) (Problem llc). Con- 
versely, let ||Tx|| = |[T*z|| hold for any x € D := D(T) = D(T*), so (Tx, Ty) = 
(T*x,T*y) for all x,y € D by the polarization formula. If z € D(T*T) ,i.e., x € D 
and Tz € D, then (Tx,Ty) = (T*Tz,y) so we have (T*Tz,y) = (T*x,T*y) for 
any y € D = D(T*). Hence T*x € D(T**) = D(T) = D and TT*z = T'Trx ; in 
other words, T*T C TT*. Finally, interchanging the roles of T and T*, we find 
that T is normal. § 


4.3.2 Remark: In fact, the closedness of T need not be assumed (cf. Problem 11b). 
Any normal operator T is maximal in the sense that it has no nontrivial normal 
extension: if S € £,(H) fulfils S$ > T, then D(S) = D(S*) c D(T*) = D(Tf). 


4.3.3 Example (the operators T;): Let (X,A,u) be a measure space with a 
o-finite measure. Given a measurable function f : X > C, we set 


Dy = {we L(X,dy) + f IfPlwldu < 00} 


and define Ty with the domain D(T;) := Dy by (Tyw)(x) := f(x)b(z) ; we call 
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it the operator of multiplication by f. Such operators play an important role in 
spectral theory as we shall see in Section 5.3. Let us list their basic properties, 
which follow from the general results derived up to now: 


(a) Ty is densely defined. To check this, we use the nondecreasing sequence of 

sets MJ = {7 € X: |f(x)| << n},n=1,2,..., with USL, Mi =X. Let 

n be the characteristic function of MJ ; ee ee any vector w € L?(X,dy) 

ee function Xn~ belongs to Dy and or w|| - 0 by the dominated— 
convergence theorem, i.e., D; = L?(X,dy). 


(b) Ty; = 7, iff f(x) = 9(x) for u-a.a. x € X. The sufficient condition is obvious. 
To check the necessary condition, we use the sets N, := Mf M8. Their 
characteristic functions again belong to D; = D,, and since U, N, = X, it 
is enough to show that f(x) = g(x) for p-a.a. x € N, ; this follows easily 
from the relation ||(T; — T,)xwn, || = 0. 


(c) Assertion (a) implies existence of T? . Mimicking the argument of Example 4.1.3 
we check that 77 = T; ; the two operators have the same domain D; and 
Zy|| = [TF for any Y € Dy , so 


(i) T; is normal, 
(ii) Ty is self-adjoint iff f(x) EIR ae. in X. 


(d) If f € L°(X,du), we have Dy = L?(X,dy) and ||T/bl? = fx fPldPdp < 
FZ Hell? for any w € L?, so T; is bounded and ||T+|| < || f||2, . To show that 
the converse is also true we have to check that for a given T; € B(L?(X,dy)), 
the set N := {x € X:: |f(x)| > ||T;||} is u-zero. Let R € A be any 
set with p(R) < oo, then ||TyxRnanl|l? < |!T;|/?u( RAN) ; in other words, 
Sx(\fP-lITell? xenndu < 0. But |f(x)| > ||Ty|] on N, so the inequality can 
hold only if u( RMN) =0; since yp is o—finite we arrive at the desired result. 
We see that the condition f € L® is necessary and sufficient for T; to be 
bounded; if it is valid we have ||T;|| = || flo. 


Let us now turn to spectral properties of normal operators. In the same way as 
in Section 3.4, we can check that \ € o)(T) iff \ € op(T*) and the corresponding 
eigenspaces coincide, 

Ker (T — A) = Ker(T* —)). (4.1) 


The eigenspaces corresponding to different eigenvalues, A # w, are mutually or- 
thogonal, Ker (T—A) 1 Ker(T—y). Furthermore, Proposition 4.1.1b implies that 
the residual spectrum is empty, o-(T) = 0, and A € o,(T) iff Ran(T—-A) 4H. 
Theorem 3.4.3 also extends to the unbounded case. 


4.3.4 Theorem: The resolvent set of a normal operator T coincides with its 
regularity domain, i.e., \ € p(T) iff there is c(A) such that ||(T—A)z|| > c(A)||z|| 
holds for all x € D(T) ; this is further equivalent to Ran(T—A) =H. 
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Proof: The argument of Theorem 3.4.3 can be followed up to the point where we 
have to check that Ran (T—A) is closed. Here we must use the fact that T is closed 
(see Problem 1.64); due to this Ran(T—A) =# implies (T—A)"' € B(H). 


4.3.5 Corollary: (a) Let T € £,(H); then the following conditions are equivalent: 


(i) A€o(T), 
(ii) inf{ |(T—A)zl| : x € D(T), |x|] =1} =0, 
(iii) there is a sequence of unit vectors x, € D(T) such that (T—A)z, > 0, 


(iv) Ran(T—\) £H. 


(b) The spectrum of a self-adjoint operator A lies on the real axis. In particular, 
if A is below bounded then inf o(A) > inf{ (xz, Az): x € D(A), ||x|| = 1} 
(in fact, an identity is valid — cf. Proposition 5.4.1a below). 


4.3.6 Example (spectrum of the resolvent for T € L,(H)): Let pw be a regular 
value of a normal operator T. Since Rr() is normal (Problem 18a) we can use 
the above results to determine its spectrum. First of all, 0 € p(Rr(u)) iff H= 
Ran(Rr(u)) = D(T), and this is in turn equivalent to the boundedness of T ; 
hence the spectrum of Rr() contains zero iff T is unbounded. Further, let \ 4 0 
and y € Ran(Rr(u)—A), we, y = (Rr(u)—A)x for some x € H. The vector 
z:= —ARr(u)x belongs to Dr and satisfies the identity y= (T—p—A7")z, which 
means that y € Ran(T—p—A7!). The same argument in the reverse order then 
yields the identity Ran(Rr(m) — 4) = Ran(T—y—A7'), and by Corollary 5a, 
A€o(Rr(u)) if w+A! € o(T). This gives the sought expression for o(Rr(1)) 
of a normal operator T’, 


{A: A=(v— yp), v E o(T)} ... T bounded 
o(Rr(u)) = 
{A: A=(v—yp)“, ve o(T)} U {0} ... T unbounded 


Using further the fact that the spectral radius of a bounded normal operator equals 
its norm (see Problem 3.29) we get 


(Rr(a)l = up| = ve ot) }. (42) 


|v 
In the same way we can find the spectrum of the operator TRr() , which is again 
normal and bounded; we have o(TRr(u)) = {A: A=v(v—p)"}, v € o(T)}, in 
particular |7Rr(4)|| = sup,eo¢r) vn)". 


4.3.7 Example (the operators T; revisited): Now we want to find the spectrum 
of the multiplication operators T; of Example 3. Given a measurable function 
f:X > Cwe put RYW(f) = {r’(€ ©: w(f-((A-e,A+e))) £0 for anye >0}; 
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this set is called the essential range of the function f (with respect to ys; the index 
p. will be omitted when it is clear from the context). We shall prove that 


o(T;) = Ress(f). 


Let  € Ress(f). We denote M. := fi-)((A —¢,A +6)) ; since p is o-finite we 
may assume p(M,) < oo without loss of generality. The function f is bounded on 
Mz, so We := u(M.)~'xm, belongs to D;. It has a unit norm and ||(T;—A)#e||? = 
(u(M.))~* fy, |f-A|?du < €? , and since € was arbitrary,  € o(Ty) by Corollary 5a. 
On the other hand, if A ¢ Ress(f) then there is ¢ > 0 such that p(M.) = 0; hence 
we have fy |b|?du = Sxyy, \l?du for any y € L*(X,du) , and consequently 


y—Awl? =f, WO-APWOPaue) > ef WwPdu= lvl? 


for all % € Dy, i.e., 4 ¢ o(T;) by Theorem 4. In particular, if yu is the Lebesgue 
measure on JR” and f : IR > C is continuous, then o(T;) = RanT;. 


In Section 3.4 we introduced the notion of pure point spectrum for B € B(H). 
When the operators in question are unbounded, we have to pay attention to their 
domains; for instance, the operators T, and T, from Example 4.1.4 have the same 
orthonormal basis of eigenvectors but, of course, T; # T;. In the general case, 
we therefore introduce the notion for normal operators only: we say that an ope- 
rator T € L,(H) has a pure point spectrum if there is an orthonormal basis € := 
{€ataer © D(T) consisting of eigenvectors of T, i.e., Teg = Aa€a - 


4.3.8 Proposition: If T is a normal operator with a pure point spectrum, then 
oy e571) aideD(T) ={ eH: Soe |(eaye)|? Nal? <eco } . 

Proof: The assertion about the spectrum is proved in the same way as in Theo- 
rem 3.4.7. Denote the right side of the latter relation as De. If x € D(T) then 
Tx? = Xa l(ex, Tr)? = La |(T*ea, 2)? = La lAcl*|(€a,2)|?, so D(T) C De. In 
view of Proposition 2.1.6a and the Hélder inequality, De is a subspace in H ; the 
relation Tr := Ya Aa(€a;L)€a defines on it an operator which satisfies Poe, 
Mimicking the argument of Example 4.1.4 we can check that D(T*) = De and 
T=). \GO8, DC o, "80 \|Tx|| = ||T*x|| holds for all Ps € De. Hence T is 
normal, and therefore T = T by Remark 2, i.e., D(T) = | 


Let us next mention an important application of Theorem 4. 


4.3.9 Theorem (self-adjoitness criterion): Let A be a symmetric operator; then 
the following conditions are equivalent: 


(a) A is self-adjoint, 
(b) A is closed and Ker (A* +1) = {0}, 
(c) Ran(A+i)=H 
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the numbers +i can be replaced by the pair z,z for any z€C'\ R. 

Proof: A self-adjoint operator A is closed and satisfies Ker (A* +7) = {0} because 
At +i = A¥i and o,(A) C ofA) C R ; thus (a) implies (b). If (b) is valid, 
then Ran(A+i) = H by Proposition 4.1.1b and ||(A + 7)x|| > ||z|| holds for 
any x € D(A) (Problem 4); in combination with the closedness of A this implies 
that Ran(A + i) is closed (cf. Problem 1.64). Finally, let (c) be valid, and denote 
z= (A*+i)y for any y € D(A*). Since Ran(A+i) = H and AC A’, there 
is a vector yo € D(A) such that z = (A*+i)yo. Then y—yo € Ker(A* +7) = 
(Ran (A — i))+ = {0} so y= y; this implies D(A) = D(A*). Ei 


4.3.10 Corollary (essential self-adjointness criterion): Let A be symmetric; then 
the following conditions are equivalent: 


(a) A is e€.s.a. , 
(b) Ker(A* +2) = {0}, 
(c) Ran(A +2) is dense in H; 


the numbers +i can again be replaced by any nonreal z,Z. 
Proof is left to the reader (Problem 20). 


4.3.11 Example (operator Q revisited): Denote Qo := Q)C§°UR). In view of 
(1.2), to any w € L?(UR) and < > 0 there is ¢ € C§(R) such that ||l~—@|| < 
e. The functions nz : ni (x) = o(x)(x +7)! then also belong to C%°(IR), so 
ll — (Qo + i)nx|| = ||Y—@l| < €, te., Ran(Qo +i) = L?(IR). Hence the operator 
Q is e.s.a. on C§°(IR) ; this is also true for other operators (Problem 17). 


It is often difficult to check the self-adjointness of a given operator using either 
the definition or the above criterion. One way to counter this problem is based on 
a perturbative approach: we have a self-adjoint operator A and look for a class 
of symmetric S such that A+ S is self-adjoint. A simple example is given in 
Problem 3d; however, self—adjointness is stable with respect to a much wider class 
of symmetric perturbations. Let us mention one frequently used theorem at this 
point; other results of this type will be discussed later. 

Let A, S be linear operators on H ; we say that S is relatively bounded with 
respect to A (or briefly, A-bounded) if D(S) > D(A) and there are non-negative 
a,b such that the relation 


|Szl| < aff Axl] + bljz|| (4.3) 


holds for all z € D(A). The infimum of all a > 0 for which there is b > 0 such 
that the above inequality is valid is called the relative bound of S with respect to 
A, or the A-bound of the operator S ; notice that the inequality may not hold if we 
substitute for a the A-bound itself (Problem 21). It is useful sometimes to work 
with squared norms. It is clear that the condition 


Sa? < a? || Axl? + 4?l|x\? 
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with a=a and {= b implies (4.3), and this in turn implies the above inequality 
with a? = (1+e)a? and 6? = (1+ .e7!)b? for any € > 0, so the infimum of all 
a > 0 for which it is valid with some (@ coincides with the A—-bound of S. 


4.3.12 Theorem (Kato-Rellich): Suppose that A is self-adjoint, and S is sym- 
metric and A~bounded with the A-bound less than one; then 


(a) A+S is self-adjoint, 
(b) if D Cc D(A) is acore for A then A+S is also e.s.a. on D. 


Proof: By assumption, A+S is symmetric, D(A+S) = D(A), and the “quadratic” 
inequality holds for some a < 1. In view of Problem 3d, we may assume a # 0 ; 
we set 7 := G/a and rewrite the condition in the form ||Sz||? < a?||(A + iy)z||? 
(cf. Problem 4). The numbers +77 are thus regular values of the operator A, so 
D(A) = Ra(tiy)H. Substituting 2 = R4(+iy)y in the condition, we get for any 
y € H the relation ||SR4(tiy)y|| < allyl]. Hence the operators Bs := SRa(+iy) 
are bounded with ||B.s|| < a <1, and the By +/J are regular by Lemma 1.7.2. 
This means that for any z € H there are yz € H such that z = (Bi + I)yn, 
and to them in turn we can find rz € D(A) such that (A F 7y)xz = ys. Then 
z= (SRa(tiy) + I)(AF iy) rs = (S+AFiy)vz, ie, Ran(S+AFiy) =H and 
A+S is self-adjoint due to Theorem 9. 

Now let D be a core for A ; then for any x € D(A) there is a sequence 
{tn} C D such that x, — x and Az, — Az. Due to the assumption, we get 
(A + S)(an—z)|/? < 2IA(en —2x)|? + 2||S(en -2)|/?_< 2(1 + o*)|| A(z, —2)||? + 
26? ||z,—z\|?, and since D(A+S) = D(A) we obtain (A+S))}DDA+S. On the 
other hand, (A+S))/}DCA+S and A+S is closed as we have already proven, 
so the opposite inclusion is valid too. & 


4.4 Reducibility. Unitary equivalence 


Let T bea linear operator on H with the domain Dr. If G isaclosed T-invariant 
subspace, then T}/G is an operator on G ; we call it the part of T in G. Suppose 
further that subspaces G, and Go := Gj are both T-invariant, and denote by 7; 
the part of T in G;, 7 = 1,2, then we define the operator T; 7 with the domain 
D(T, © Tz) = (Dr NG) ® (Dr NGe) by (Ti @ T2)(x1 + 22) = Tyx, + Torq for 
z; € Dr NG; ; we call it the orthogonal sum of T; and T,. We obviously have 
T, ®T, =T if Dr =H, while in the general case only the inclusion T; @ Tz c T 
is valid unless the projections E; to G; map the domain Dr to itself. 


4.4.1 Example: Consider the operator P from Example 4.2.5 on L?(—1,1) and let 
E be the operator of multiplication by x(o,1). The subspaces Ran E' and (Ran £)+ 
are obviously P-invariant, but Ew belongs to Dp for ~ € Dp only if (0) =0. 


This motivates the following definition: a linear operator T on H is reducible 
if there is a nontrivial projection & such that (i) EDy C Dr, and (ii) the subspaces 
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EH and (I—E)H are T-invariant; in the opposite case T is irreducible. If T 
satisfies the above conditions, we also say that it is reduced by the projection E or by 
the subspace EH. It is clear that T is reduced by EH iff it is reduced by (BH). 
Moreover, in the notation used above T = T; ®T2 iff the subspace G, =: L,H 
reduces T ; in that case D(T;) := Dr G; = E;Dr, where Ey := I—-E,. This 
results easily extends to any finite number of orthogonal subspaces; for an infinite 
orthogonal sum an extra condition is needed (Problem 23). 


4.4.2 Theorem (reducibility criterion): A linear operator T is reduced by a pro- 
jection E iff ET Cc TE;; this is further equivalent to the conditions ED; C Dr 
and ETx =TErz for all x € Dr. 

Proof: Let T be reduced by E. We take any x € Dr ; by assumption the vectors 
Ex, (I—E)z belong to Dr and TEx € EH, T(I—E)z € (EH)+, so ETx = 
ETEx+ET(I-E)x = ETEx = TEx. On the other hand, the condition ET c TE 
implies EDr Cc Dy and Tx = TEx = ETx € EH for any x € EHN Dr ; in the 
same way we check T-invariance of the subspace (J-E)H. & 


4.4.3 Example (reduction of a normal operator): Let » be an eigenvalue of a 
normal operator T. Since T is closed the corresponding eigenspace Ker (T—A) is 
also closed, and furthermore, Ker(T—.) C Dr implies that the projection E(,) 
to the eigenspace satisfies E(A)Dr C Dr. The subspace Ker (T—.) is obviously 
T-invariant, and since (Ty,z) = (y,T*x) = X(y,x) = 0 is valid for any y € 
(Ker (T—A))+N Dp and x € Ker (T—), the orthogonal complement (Ker (T—A))+ 
is also T-invariant. Hence a normal operator is reduced by any of its eigenspaces. 

If uw, are different eigenvalues of T', the corresponding eigenspaces are orthog- 
onal, E(u)E(A) = 0. If o,(T) := {A1,.--,An} is finite, then EB, := F_, E(A;) 
is the projection to @j., Ker (T—A,;), and each of the projections E(A;) reduces 
T ; thesame'is true*for B,. Further, t-og(7) = {dj : 7 = 1,2)...) Become 
ably infinite and denote by EF the projection to Hp := @j2, Ker (T—Aj;), ie., 
E = Vy B(A;) := s-limpioEn. Since FE, = ExE = E, for any n, we have 
Yn := E,Ex = E,x € Dr for each x € Dr and y, — y. At the same time, 
E,, reduces T, so Ty, = TE,2 = E,Tx — ETrx ; due to the closedness of T 
we get Ex € Dr, i.e., EDr C Dr and TEx = ETx. Hence the subspace H, 
reduces the operator 7’, and its parts T, := TH, and T, := T)H> are normal 
operators on Hilbert spaces H, and H+, respectively (Problem 24b). Notice that 
the assumption that o,(T) is countable was used just for convenience; in the same 
way we can check using Proposition 3.2.13 that H, := DNeo,(T) N(A) reduces the 
operator T for o,(T) of any cardinality. 

It is clear from the construction that H, contains an orthonormal basis consist- 
ing of eigenvectors of T ; on the other hand, T, has no eigenvalues. Hence we have 
found that any normal operator decomposes into the orthogonal sum 


T=T,®T., o(T) = o(Tp)Vo(T.) = o,(T)Uo(Te), 


where J, is a normal operator with a pure point spectrum while o(Tc) = @ ; the 
decomposition of the spectrum follows from Problem 24b and Proposition 4.3.8. The 
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formulas hold, in particular, for any self-adjoint A, in which case the operators A, 
and A, are also self-adjoint. 


If T is bounded, the condition of Theorem 2 means that the operators E, T 
commute, ET = TE. Motivated by this we extend the notion of commutativity in 
the following way: suppose that T' is any linear operator on H and B € B(H) ; then 
we say that T and B commuteif BT Cc TB, or more explicitly, if BDr C Dr and 
BTx =TBz holds for any x € Dr. According to this definition, for instance, any 
bounded invertible B commutes with its inverse while, of course, BB-! ~ B-1B 
if RanB #H. Commutativity can be checked using the resolvent of T’. 


4.4.4 Theorem: A closed linear operator T with a nonempty resolvent set com- 
mutes with a bounded B iff Rr(u)B = BRr(p) for at least one w € p(T) ; in that 
case the relation holds for all  € p(T). 

Proof: 'To check the sufficient condition, notice that any x € Dr can be written 
as c= Rr(u)y; then Br = BRyr(w)y = Rr(u)By so Ba € RanRr(p) = Dr. 
Moreover, BTx = B(T—p+p)Rr(w)y = By + wBa and TBzr = (T-p)BRe(p)yt 
pBr = By+pBz,so BTx = TBe for all x € Dr. Conversely, the inclusion 
BT C TB means that B(T—p) C (T-yw)B for any p € C, which in turn gives 
Rr(u)B = BRr(u) provided p € p(T) (Problem 25a). & 


4.4.5 Corollary: Let T,S be closed operators with nonempty resolvent sets. If 
there are Ao € p(T) and po € p(S) such that Rr(Ao) commutes with Rg(p9) , 
then Rr(A) and Rs(u) commute for all A € p(T), w € p(S). 


This property can serve as an alternative definition of commutativity for self-adjoint 
operators (cf. Problem 5.23b); it is essentially the only situation when we deal with 
commuting operators, which are both of unbounded. 

Linear operators T on H and S on G are said to be unitarily equivalent 
if there is an unitary operator U : G — H such that T = USU™!. This condition 
means, in particular, that Dr = UDs and RanT =U RanS. The unitary opera- 
tors in B(H) form a group with respect to multiplication, so the unitary equivalence 
between operators on a given 7 is reflexive, symmetric, and transitive — hence the 
name — and we can decompose a family of operators into equivalence classes. Uni- 
tary equivalence preserves many operator properties; we shall list a few of them (see 
also Problem 27). 


4.4.6 Proposition: Let T,S be unitarily equivalent, T= USU~'!; then 


(a) if S is densely defined, the same holds true for T and T* = US*U~'. In 
particular, if S is symmetric or self-adjoint, then T is also symmetric or 
self-adjoint, respectively, 


(b) if S is invertible, so is T and T~’=US™'U-', 


(c) if S is closed the same is true for T ; we have o(T) = o(S) and I(T) = 
Ue (S), where Ug[z,y] := (Uz, Uy]. 
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Proof: In view of Problem 3.4, Ds = G implies UDs =H. Using Proposition 4.1.2e 
and Problem 2c, we can easily check T* = US*U~!, which means, in particular, 
that D(T*) = UD(S*) ; from here we get the remaining part of assertion (a). Next 
we denote T’ := US-!U-!. We have D(T') = URanS = RanT and T'Txr=2 
for all z € Dr so T’ = T~!;; this proves (b). The operator Ug: G@G —- HOH is 
unitary and UI'(S) = I(T) holds by definition. If S € C(H) we have UgI'(S) = 
Ugl(S) = Usl(S), so T € C(H). The operators T—A and S—A are unitarily 
equivalent for any 4 € C; if A € p(S) then Ran(T—A) = U Ran(S—A) =H and 
Ker (T—A) = U Ker (S—A) = {0}, so A € p(T). Interchanging the roles of T,S we 
get p(T) = p(S), and therefore o(T)=o(S). & 


4.4.7 Example: We shall show that the operators P and Q on L?(JR) are unitar- 
ily equivalent. The Fourier—Plancherel operator F is unitary by definition; hence 
T := F7'!QF is self-adjoint and its domain consists of the vectors ~ = F-'¢ 
with @ € D(Q). Since D(Q) c L'UR) (Problem 5), we can write the identity 
(Qr)/*p(x) = ifp ors 1 yo(y) dy + In b(y) dy. It follows from Proposition 3.1.5 
and Example 3.1.6 that w is absolutely continuous in JR and 


d _ 

~iy(@) = ny? Sf =) Qeyqyay = (FQ¢)(2) 
xe 

hence 7 € L?(R) and —iy’ = F-!QFy, which we may write as T C P. Finally, 

both operators are self-adjoint and therefore equal each other, 1.e., 


P= FOF. (4.4) 


This unitary equivalence allows us to study the properties of P using the simpler 
operator Q. For instance, we immediately get o( P) = 0(Q) = R. 


4.5 ‘Tensor products 


Now we want to show how one can construct to given operators JT, on Hilbert 
spaces H,, r = 1,...,m, an operator on the tensor product H; @-:-@H,. For 
simplicity we shall assume a pair of Hilbert spaces; the extension to any finite n is 
straightforward. 

First, let us have a pair of bounded operators B, € B(H,). We define the ope- 
rator B,x Bo called the tensor product of B,, Bz by (Bix Bo)(x®y) := Byx@ Boy 
for any x € H,, y € He and extend it linearly to H,xHg ; the correctness of this 
definition can be checked using Problem 2.20. 


4.5.1 Lemma: The operator B;~= Bz is bounded with — < ||By|| || Bol]. 

Proof: Since we can write B,x By = B° ee, where fe = B,xI_ and Boy = 1,;xBy, 
it is sufficient to prove that ||/Beul| < ||B,|| |lull eile for any u € HyxHo. 
Let wu = \).,2; ® y; and choose orthonormal bases fe1,.-- Teme A,..., fa} 
in the subspaces spanned by {x1,...,2y} and {y1,...,yn}, respectively; then 
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U= Vins Arser@f, and | Bull? = |[Ds.9.@ fall’ , where we denote gs := 0, GrsBie, - 
However, the set {g,@ f,: s=1,...,m} is orthonormal, so ||B°ull? = , |lgell? < 
|B, ||?||u||? ; the relation for r = 2 is checked in the same way. § 


As a bounded densely defined operator, B,x By has a unique extension to H;@ 
Hz which we call the tensor product of the operators B,, By, and denote as 
B, ® By. We have ||B, ® Ba|| > [Bir @ Boy|| = ||Bizlli||Boyll2 for unit vectors 
xz € Hi, y € He, and the inequality remains valid if we take the supremum over 
Z,y ; in combination with Lemma 1 this gives 


|| Bi @ Boll = || Bill Ball. (4.5) 


The map [{B,, Bg] +> B; ® Bg defined in this way is obviously bilinear and has the 
following simple properties, the proof of which is left to the reader (Problem 30, see 
also Problems 31-34). 


4.5.2 Theorem: Let B,,C, € B(H,), r=1,2; then 
(a) B,C, @ BoC, = (B, 4) Be)(Cy @ C2) and (By ® Bz)* = BY © B3 ; 


(b) if the operators B, are invertible, the same is true for B, © By and the inverse 
(B; ® By)~" = By’ @ Bz", 


(c) if the operators B, are normal (unitary, Hermitean, projections), then B;@ By 
is respectively normal (unitary, Hermitean, a projection). 


Let us return to the general case of operators T, with the domains D(T,) Cc 
H,, r= 1,2. We define the operator T; ® Tz with the domain D(T,)xD(Th) by 


(T; ® T2) (= r;® s) = )/ 112; @ Tay; 
; 


J 


for any n-tuples {x;} C D(T,) and {y;} C D(T2) ; we call it the tensor product 
of T, and To. 


4.5.3 Remarks: (a) Strictly speaking, T; ® T2 corresponds rather to T;xT> ac- 
cording to the previous definition. The problem is that each definition is 
natural in its context. For bounded operators B,xB, played an auxiliary 
role only due to the existence of its unique extension; this is no longer true 
in the unbounded-operator case, where D(T,)x D(T2) represents the suitable 
“Smallest” domain for the tensor product. 


(b) The tensor product Hi ®Hp2 can have different realizations to which, of course, 
different realizations of T;@T2 correspond. Using Proposition 2.4.2, however, 
it is easy to check that any two of them are unitarily equivalent. 


4.5.4 Theorem: Let operators T,, r= 1,2, be densely defined; then 
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(a) T, @T2 is densely defined and (T; @ T2)* > Tf @T3, 
(b) if the operators T, are closable, so is T, ®T2 and T; @T; DT, @T2. 


Proof: Since D(T-) = H, , Proposition 2.4.4a implies that T,@T> is densely defined; 
the inclusion then follows from the appropriate definitions. Using Theorem 4.2.3 we 
find that T, @ T> is closable if T, are closable; furthermore, z, + z and Yn — y 
imply tp @ Yn + Z@ y, so Proposition 1.6.6 yields the remaining inclusion. 


4.5.5 Example: In general, the above inclusions are not identities even if one of 
the operators is bounded. To illustrate this, consider an unbounded self-adjoint 
operator A on H; and a one-dimensional projection E on a two-dimensional 
Hz, in which we choose an orthonormal basis {e, f} such that e € Ran E. Using 
Problem 2.21a we find D(A@ FE) = {r@e+y@f: z,y € Da} ; on the other 
hand, a simple argument shows that the adjoint has the domain D((A ® E)*) = 
{xr@e+y@f:xreE Da, y €H,} and acts onit as (A@E)*(x@e+y@f) = Are. 
Since A is unbounded, we have D(A ® E) # D((A®@ E)*). In particular, A@E 
is not self-adjoint, but it is e.s.a. because (A ® E)* is symmetric. 


Tensor products of unbounded operators again fulfil natural algebraic rules; 
however, we should be careful now about the domains. 


4.5.6 Proposition: Let T,,S, be linear operators on H,, r = 1,2; then 
(T, aia Sy) @T,=T, @T2+5,@T2 and (TS;) ® (T2S2) (a (Ti ® T2)(S) ® S2) : 


In particular, if S, is invertible and Ran S, Cc D(T,) for at least one r, then the 
last relation turns to identity. 


Proof: The inclusion (T,; + S,;) @ To C T, @ Tz + S; © Tz is obvious. Further, 
let x € (D(T;)xD(T2)) N (D(S1)x D(Ta)), te. © = Lhe 27 @ Yj = Leer T, OM 
with z; € D(T;), x, € D(S:) and y;,y, € D(T2). Let {e}, be an orthonormal 
besie-in {yy,... ,.Qt>---aYiyhe C D(T2), 0 yy => yperer and a = Dyrdwer; 
then the vectors % := Lj_,cyx; € D(T;) and # := Dea, dat, € D(Si) satisfy 
Li(Z — Z,) ® e: = 0. Since the e; are orthogonal, we get Z = %,1=1,...,N, 
i.e, X € D(T1)N D(S;). Finally e € D(T2), so x = 4) % @ a belongs to 
D(T; + S;) ® D(T2) , which proves the first relation. The remaining inclusion follows 
directly from the definition; we leave to the reader to check the sufficient conditions 
for it to turn to identity (Problem 35). § 


4.6 Quadratic forms 


The simple correspondence between bounded operators and bounded sesquilinear 
forms expressed by Proposition 3.1.1 poses the question whether one can extend it 
to the unbounded-operator case. We shall see that domain considerations make the 
problem much more complicated and an affirmative answer can be obtained only for 
certain classes of operators. 
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First we must extend the definitions of Section 1.1 to include unbounded forms 
which may not be everywhere defined. Let D be a subspace in H ; a sesquilinear 
form f: Dx D—C will be briefly referred to as a form on H with the domain 
D(f) := D. If D(f) =H the form f is said to be densely defined. In view of 
the polarization formula, there is a bijective correspondence between f and the 
quadratic form qs: qs(x) = f(x,x). It is thus reasonable to simplify the notation 
and use the same symbol f for both, setting f(x] := f(z,x) for any x € D(f). 

We can define naturally the operation of summation and scalar multiplication 
for forms, with the domains obeying the same rules as for summation of operators; 
restrictions and extensions also have the usual meaning. The form associated with 
the unit operator is sometimes denoted as e: e(z,y) = (x,y). The set O(f) := 
{ f[z] : x € D(f), ||x|| = 1} is called the numerical range of f ; in the same way 
as in the operator case we find that f is symmetric iff O(f) C IR. A symmetric 
form s is below bounded if m, := inf O(s) > —oo and positive if m, > 0. Notice 
that if s is below bounded there is always a “shifted” form which is positive, e.g., 
So := s—m,e. If s is positive, then the Schwarz inequality, 


Is(x,y)| < (s[x]s{y])”?, 


is valid for arbitrary vectors x,y € D(s) ; this implies, in particular, the relations 
s[xt+y]/? < s[x]/?+s[y]/? and |s[x]/?— s{y]'7| < s[x—y]¥/?. 


4.6.1 Examples: (a) Let T be an operator on H with the domain Dr. The form 
fr: fr(x,y) = (x,Ty) with the domain D(fr) := Dr is said to be generated 
by 7. Its numerical range clearly coincides with ©(T) ; in particular if T is 
symmetric and below bounded the same is true for fr. 


(b) A linear operator S: H — H, defines a positive form p with D(p) := Ds by 
p(x, y) = (Sz, Sy). 


(c) Let H = L?(a,b) with b—a < oo. Given real numbers c,,c, and a real- 
valued function V € L?(a,b) such that V(x) > m ae. for some m, we set 
D(t) := AC{a,b] and 


(6.0) = [ (HRW! a) + VOC W(e)) de + AFH) + cadTAV(a). 


The form t is densely defined and symmetric. The integral part is obviously 
below bounded; we can choose m := inf ess ze(a4)V (x). The other two parts 
are also below bounded. To check this, we express $(b) = (gn, $’) + (g},, ¢) 


integrating by parts with g,(z) := (#=2)", and ¢(a) similarly using the 
function h,(x) := —gn,(b+a—Zz) . It yields 


[4] > |I¢'\? + mlldll? — leo] (8)? — lea] |6(a)|? = All¢'ll? — BIlgll?, 


where A := 1 — (\cal + |col)|lgn||? and B := 2(|ca| + |csl)\lgn||?— m. Since 


l9n|] = (ems) ; , we can choose n so that A > 0; hence the form t is below 


bounded with m > —B (corresponding to the chosen rn). 
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Given a symmetric below bounded form s we can equip its domain D(s) with 
the inner product (x,y), := (x,y) + (1—ms)(x, y) ; this can be also expressed as 
(x,y), = (2, Y)eo = S0(z,y) +(z, y) , where So is the positive form introduced above. 
In this way D(s) becomes a pre-Hilbert space which we denote as H, ; if it is 
complete, i.e., a Hilbert space, the form s is called closed. 

A sequence {zn} C D(s) is Cauchy with respect to ||- ||, iff it converges to 
some x € H and for any € > 0 there is an n, such that |s[z,—2m]| < € holds 
for any n,m > n, ; for the sake of brevity, we shall use the symbol Zn “, 7 asa 
shorthand for these conditions. 


4.6.2 Proposition: The following conditions are equivalent: 

(a) s is a closed form, 

(b) s+ ae is closed for any aé R, 

(c) if a sequence {rn} C D(s) fulfils xz, “4 2, then z € D(s) and s{z—z,] — 0. 
Proof is left to the reader (Problem 38). 


4.6.3 Example: Let p be the form of Example 1b; then a sequence fulfils zy Eg 
iff In + x and {Sz,} is convergent in +, ; hence p is closed iff the operator 
S is closed. For instance, let H = H, = L*(a,b) and S:=—i~ with the domain 
D(S) := AC|a, }], i.e., S = P* in the notation of Example 4.2.5; then p: p(~,¢) = 
f° #(x)v' (x) dx is a closed form. 


4.6.4 Lemma: If 2, “> x and yn > y, then the sequence {s(2p,yn)} is Cauchy; 
in particular, s(n, yn) — s(x, y) if the form s is closed. 


Proof: The conditions 2, —> 2, y, > y imply (n.yn) — (x,y), so we can 
assume without loss of generality that s is positive. The assertion then follows 
from the estimate 


[s(@n, Yn) — S(2m,Ym)| < |$(2n —Zm, Yn)| + |s(tm,Yn—Ym)| 


< 8[2p_—2Lm]!/?5[yn]/? + 8[Yn— Ym] '/?s [en] /? 


combined with the fact that the sequences {s[zp]}, {s[yn]} are Cauchy by the 
Schwarz inequality and therefore bounded. If the form s is closed, then the condi- 
tions zp, — 2, Yn > Y imply s{z,—z] +0 and s[y,—y] — 0, so it is sufficient 
to put 7, =z and ym =y in the above estimate. [ff 


Not every symmetric form is closed, so we have to ask about existence of closed 
extensions. The situation is less transparent than in the operator case, where we 
could lean on the concept of a graph, and the results are generally weaker; we shall 
see below that there are symmetric below bounded forms with no closed extension. 

A form s is called closable if there is a closed form t > s. The following theorem 
shows that a closable form always has a smallest closed extension, which we call the 
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closure of s and denote as 3. If s is a closed form and D Cc D(s) is a subspace 
such that s\ D = it is called a core for s. 


4.6.5 Theorem: (a) A symmetric below bounded form is closable iff 2p ©, 0 
implies s[r,] 0. 


(b) Let s be closable and denote by D(3) the subspace consisting of those x € H 
for which there is a sequence {rn} C D(s) such that z, + x. Then the 


relation 
3(x,y) == lim s(2n, yn) 


with z, —> x and y, —» y defines on D(s) the form 3 which is the smallest 


closed extension of s, and moreover, mz = ™,. 


Proof: If t D> s is a closed form then z, a0 implies 2p, ee 0, and 0 = 
limp—oo t[Zn] = limps 8{%,] due to the closedness of t. Suppose on the contrary 
that x, <> 0 implies s{z,] — 0 ; we shall show that the relation 3(z,y) := 
limp—co $(Zn, Yn) defines on D(S) a closed form such that t D 3 holds for any closed 
extension t of s. Existence of the limit follows from Lemma 4; its independence of 
the chosen sequences follows from the estimate used in the proof of the lemma. In 
view of Problem 38b, D(S) is a subspace and 3 is a form which extends s ; it is 
furthermore clear from the definition relation that 5 is symmetric, below bounded, 
and mz > _™m,, which in combination with the inclusion 3D s gives mzy= mg. 

It remains to check that § is the smallest closed extension. Let x € D(S) and 
In —-» x, 80 to any € > 0 there is n, such that |s[z,—2m]| <¢ for n,m > ne. 
Passing to the limit m — oo we get |S|%n—az]| <¢ if n > n.; hence (i) D(s) 
is dense in the pre-Hilbert space Hs = D(S) and (ii) if a sequence {z,} C D(s) 
is Cauchy with respect to ||- ||,, then there is x € Hs such that ||z,—z||, > 0. 
This allows us to check that Hz is complete. Let a sequence {y,} C Hs be Cauchy 
with respect to || - ||; ; then due to (i) there is a sequence {z,} C D(s) such that 


\|2n—Ynllz < n=! for n =1,2,.... This implies that the sequence {zn} is Cauchy 
with respect to ||- ||s, and therefore also with respect to ||- ||, ; then ||/z,—z||s; — 0 
by (ii). Finally, |lyn—allz < |lzn—zlls + n-' — 0, so Hs is complete and 5 is a 


closed form. 


Finally, consider a closed form t D s and any z € D(8). By assumption, there 
is a sequence {z,} C D(s) such that rp ““, 2; then ty —> x and the closedness 
of t implies z € D(t). Using the definition of 3 we get 3[z] = lim, $[%,] = 


lim wedieg| —Tlelso ts. Ti 


4.6.6 Example: Consider again the form p of Example 1b. In view of Propo- 
sition 1.6.6, t, —» 0 implies plz] — 0 iff operator S is closable. Hence a 
nonclosable positive form corresponds to any operator which is not closable. For 
instance, choose p: p(¢,7%) = ¢(0)~(0) on L?UR) with D(p) := AC(UIR). Then 
dn —“+ 0 for a sequence {¢,} C D(p) means ¢, + 0 and ¢n(0) > a ; however, 
a need not be zero, as we can easily check. 
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Fortunately, this cannot happen with a form which is generated by a symmetric 
below bounded operator. 


4.6.7 Proposition: A form s which is generated by a symmetric below bounded 
operator A is closable, and its closure fulfils 3(z,y) = (x, Ay) for any x € D(3) 
and y € D4 = D(s). 

Proof: In view of Theorem 5, it is sufficient to check that z, > 0 implies s[z,] > 
0. Suppose first that A is positive, so s is also positive; then using the Schwarz 
inequality we find that {s[zn]} is Cauchy, and therefore bounded, s[x,| < M for 
all n. Another application of the Schwarz inequality gives 


al@,) = s(tn,2n—Xm) + 3(Gn,2m) 


IA 


8[atp]!/?5[atq—atm|'/? + (Ataytm) < (Me)? + || Aznll llomll 


for all n,m > n,-. Since ty —, 0 includes the condition In — 0, the limit m — oo 
leads to s[rp] < (Me)/? for all n> ne, i.e., s[z,] + 0. If inf (A) = mg < 0, 
then the form so := s — mye is positive, and by Proposition 2, rn =4 implies 
$0[Ln] > 0 so |s[xa]| < So[%n] + |ma| \|znll? + 0 ; this proves the existence of the 
closure. For any x € D(S), there is by Theorem 5 a sequence {zn} C Da such 
that 2, “> x; then 3(x,y) = limpco (In, y) = limpaoo(tn, Ay) = (x, Ay) holds 
forany y€ Da. OF 


The following theorem plays a key role in the theory of unbounded forms. 


4.6.8 Theorem (representation theorem): Let s be a densely defined form which 
is closed, symmetric, and below bounded; then there is a self-adjoint operator A 
such that 


(a) Da Cc D(s) and s(zx,y) = (x, Ay) for any x € D(s), y€ Da, 
(b) the domain Dy is a core for s, 


(c) if there are vectors x € D(s) and z € H such that s(x,y) = (z,y) holds for 
all y € D(s), then x € Dy and z= Az, 


(d) let T be a linear operator such that Dr C D(s) and (Tz, y) = s(x.y) for all 
xz é Dr, y € D(s); then T C A. In particular, T = A if T is self-adjoint; 
hence the operator A is determined uniquely by condition (a), 


(e) A is below bounded with inf @(A) = m,. 


Proof: First suppose that m, = 0. Since s is closed, H, is a Hilbert space; for any 
y € H we define the antilinear functional l, : ly(x) := (x,y) on it. We have |\z|| < 
|z||,, so ly is continuous on H,, and by an easy modification of the Riesz lemma 
proof, there is just one y’ € H, such that (z,y) = (x,y’),, and the map C: yw 

is linear. In particular, setting + = y' we get |ly’|l2 < |ly'll llyll < ly'llsllyll, ie. 
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Cyl < ||Cylls < lly] so C € B(H). The relation (x,y) = (x, y’)s can be written 
as (x,y) = (x, Cy), = s(x, Cy) + (z,Cy), so s(x,Cy) = (x,y — Cy) holds for any 
x € D(s) and y€H. Hence if Cy=0 we have (z,y) = 0 for all x € D(s), and 
since this set is dense in #, , this further implies y = 0, so C is invertible. We also 
have (Cu, v) = (Cu, Cv), = (Cv, u) = (Cu, Cu), = (u, Cv) for any pair u,v € H, 
which means that C™! is self-adjoint by Problem 3c. Now denoting z = Cy in 
s(x, Cy) = (x,y — Cy) we find that the self-adjoint operator A := C™!—TI satisfies 
8(z,z) = (x, Az) for all x € D(s) and z € D4 = RanC c D(s). Finally, in the 
general case when m, # 0 the sought operator A equals Ap + m,I, where Apo 
corresponds to the positive form so := s — mee ; this proves assertion (a). 

To check (b) it is sufficient to show that the subspace D4 = RanC is dense 
in H, (Problem 38d). Since ||z||? = (z,Cx),, any vector from the orthogonal 
complement of RanC in H, is zero, and therefore (D4), = H,. Assertion (c) 
is obtained if we write the condition s(z,y) = (z,y) for y € Da in the form 
(oyAy)= (z, y); thisimplies + <¢ D(A*) = D, and z ="A*x = Ax. Finally, (d) 
follows from (c), and (e) from (b) in combination with Theorem 5. § 


4.6.9 Corollary: The representation theorem defines the bijection s+» A, from 
the set of all densely defined, closed symmetric below bounded forms to the set of 
all self-adjoint below bounded operators; A, is bounded iff s is bounded. 

Proof: Let A, = Az and denote D := D(A,) = D(A:), then s)D =t}D and 
injectivity of s+» A, follows from Theorem 8b. To check the surjectivity consider 
the form s,4 generated by a below bounded self-adjoint operator A ; by Proposi- 
tion 7 it has a closure u := 3,4 and u(z,y) = (x, Ay) for all x € D(s4), y E Da, 
so A= A, by Theorem 8d. § 


Operator A, is said to be associated with form s. If A is a below bounded 
self-adjoint operator and s is the corresponding form, A, = A, then D(s) is called 
the form domain of operator A and is alternatively denoted as Q(A). 


4.6.10 Example: The positive form p of Example 1b is densely defined and closed 
if SE L,(H). We have (Sx, Sy) = (x, Apy) for all x € Dg and y € D(A,) C Dg, 
which means Sy € D(S*) and S*Sy = Apy, or Ap C S*S. However, S*S is self- 
adjoint by Theorem 4.2.7, so we have A, = S*S, and D(S) is the form domain of 
operator Ap. 


The representation theorem provides a way to construct a self-adjoint extension 
to a given symmetric operator. 


4.6.11 Theorem (Friedrichs’ extension): Suppose that Ao is a symmetric below 
bounded operator, s is the closure of the form generated by Ay, and A, is the 
self-adjoint operator associated with s; then A, > Ap and inf O(A,) = inf O(Apd) . 
Moreover, A, is the only self-adjoint extension of Ap such that Da, C D(s). 

Proof: By Proposition 7, the form so generated by Ag has a closure, s := 39, and 
s(x, y) = (x, Ay) holds for all x € D(s) and y € D(Ao) = D(so) , so Ap C A, and 
inf O(A,) =m, = m,, = inf O(Ao) due to Theorems 5 and 8e. Now let A be a self- 
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adjoint extension of Ao fulfilling the stated conditions, and denote by t the form 
generated by A. Since t D s, we have ? D s and A = A; by Proposition 7 and 
Theorem 8d. Then s(z.y) = t(r,y) = (x. Ay) holds for any xr € Dis). y € Da: 
using Theorem 8d again, we get 4 C A,, i.e. A= A, because the two operators 
are self-adjoint. 


4.6.12 Remark: Given two below bounded self adjoint operators A, and A, we 
denote by s, and so the corresponding forms, A, := Ay, for r = 1.2. They are 
densely defined, symmetric, below bounded, and closed; the same is true for the form 
$ := $8, + Sq with the exception that it may not be densely defined (Problem 38e). 
If s is densely defined, then by the representation theorem there is a unique self- 
adjoint operator A, which we call the form sum of A; and Az ; sometimes it is 
denoted as Ay+A2. Since D(A, + Ag) C D(S;)M D(s2) = Dis) and siz.y) = 
8;(X, y) + So(x.y) = (x. (Ai + Ao)y), we have A; + dp C As, by Theorem Sd. te.. 
the form sum extends the usual operator sum: notice that A, + 4, may even net 
be densely defined. Moreover, if A, + Ag € C(H) it is symmetric and therefore 
has the Friedrichs extension Ag: however, it ditfers in general from the form sum. 


Ar # A, (Problem 40). 


The result about stability of self-adjointness proved at the end of Section 3 has 
its form counterpart. To formulate it. we have to reintroduce the netien of relative 
boundedness. Suppose that s.f are symmetric forms on H and s is belew bounded. 
then we say that t is s-bounded if D(s) C Dit) and there are non-negative a.6 
such that 

|t(x}| < als[z)| + bl)x|)? (4.6) 


holds for any r € D(s): the infimum of all a for which the conditien is valid with 
some 6 is called the s-bownd of the form t. Sinee an s-bounded ferm is at the 
same time bounded with respect to s+ce for anv c € R and the relative bound is 
the same. we may assume without loss of generality that s is positive. 


4.6.13 Lemma: Let s.t be symmetric forms. If s is positive. Dis) C Dit). and 
condition (4.6) holds for some a < 1, then 


(a) s+t is below bounded with m,,; > —b. 
(b) s+¢ is closed iff s is closed, 
(c) 8 +t is closable iff the same is true for s and D(s +f) = Dis). 


Proof: It follows from the assumption that (s + f)[2] > (l-a)siz) = dl}? helds 
for any xr € Dis +f), 80 Moye > —b. In a similar way. we find is = e)fel) < 
(a+ 1)sjr] + blr? and siz] < (1-a)-' (lls + Die) + be). These inequalities 
show that conditions x, —> r and x, ““* x are equivalent: assertions (b).(c) then 
follow from Proposition 2 and Theorem 5. Jf 


Combining this result with the representation theorem, Corollary 9 and Theo- 
rem 5, we obtain the mentioned counterpart of Theorem 4.3.12. 
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4.6.14 Theorem (KLMN-theorem): Let A be a self-adjoint below bounded op- 
erator and s,4 the form generated by it. If a symmetric form ¢ fulfils condition 
(4.6) for s =s,4,some a <1 and any x € Dy, then there is a unique self-adjoint 
operator A associated with me form $4 +t. The form domains of the operators 
A, A coincide, and 

(x, Ay) = (a, Ay) + (x,y) 


holds for all x € Q(A), y € Da. Moreover, the operator A is below bounded, 
inf O(A) > (l—a)m, — b, and any core of A is at the same time a core for A. 


In particular, if the form t is generated by a symmetric B, then A is the form 
sum of the two operators, and by Theorem 8c, it coincides with their operator sum, 
A := A+B =A+B. Such a result follows from the Kato-Rellich theorem with- 
out the requirement that A is below bounded; however, the relative-boundedness 
assumption in Theorem 14 is weaker than in the operator case. 


4.6.15 Proposition: Suppose that A is a self-adjoint below bounded operator, 
and B is symmetric and A-bounded with the relative bound a < 1. Let s,t be the 
forms generated by A and B, respectively; then ¢ is s—bounded with the relative 


bound <a; if t is closable, the same is true for the closures ¢ and &. 


Proof: To any a > a, there is a @ such that ||Bz|| < al|Ax|| + Gl|z|| holds for 
all z € Dg. By Theorem 4.3.12 and Problem 22, the operator A—K«B is self- 
adjoint for any x € (—a7!,a7"), and its lower bound m(«) := inf@(A—«B) 
can be estimated from below by ma—|k| max{@G(1—al«|)~}, G+ a|ma|}. Then 
Kt[z] = w(x, Bx) < —m(k)||x||? + (z, Ax) for any x € Da, and changing « to 
|x| sgn t[z] we get the estimate 


eta < Yast + (— TA + max { Fo, alma +3) fel? 


for all x € D(s), which shows that ¢ is s-bounded and its relative bound does 
not exceed inf |«|~! =a. Since a can be chosen arbitrarily close to a, this proves 
the first assertion; the second follows from the above estimate, Proposition 7, and 
Theorem 5. & 


4.7 Self—adjoint extensions 


We have already on several occasions encountered the problem of finding self-adjoint 
extensions to a given symmetric operator, e.g., in Example 4.2.5 where we solved it 
by elementary means for the operator P on L?(a,b). In this section we are going 
to present a general method which allows us to construct and classify all self—adjoint 
extensions. 

We start with some properties of the regularity domain 7(T) of a linear operator 
T on H; recall that we defined it as a set of those 4 € (to which there is a c, > 0 
such that ||(T—A)z|| > callz|| holds for any x € Dy (see Problem 1.64). In 
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particular, if A is a symmetric operator then (A) > € \ IR by Problem 4, so it 
has at most two connected components (i.e., open arcwise connected subsets); it 
is connected if A is below bounded. Similarly an isometric operator V satisfies 
(V—A)z|| > [1 — JA} | |lzl], so t(V) DC \ {A eC: |A| =1} again consists of at 
most two connected components. 

The orthogonal complement to Ran (T—A) will be called the deficiency subspace 
of T with respect to A, and its dimension will be denoted by def(T—A). If T is 
densely defined, then def (T—A) = dim Ker (T*—X) by Proposition 4.1.1b. 


4.7.1 Theorem: The map +> def (T—A) is constant on any connected component 
of the regularity domain 7(T). 

Proof: Let G be a connected component of 7(T), and for any 4 € G denote by 
E) the projection onto Ran (T—.)+. By assumption, any two points 4,,A2 € G 
can be connected by a curve which is a compact set in ( (as a continuous image of 
the segment [0,1] ); hence it is sufficient to check that the map 4+ def (T—A) is 
locally constant. This is true if sup{ ||(I-E,)E,a||: 2 € H, ||E,x|| = 1} <1, and 
the same inequality with A,» interchanged are valid for all « in some neighborhood 
of the point (cf. Proposition 3.2.11 and Problem 3.18c). Since G is a part of 
the regularity domain, we have 


2 
I(P—w)al] 2 (T—A)all — [te—A)al] 2 5 eallel 
for |w—Al < ZC. The norm in question can be expressed by Problem 3.16 as 


(2 -£,)Eaal| = sup{|(Zax,y)|: y € Ran(T—y), |lyl] = 1}, and using the fact 
that E,x € Ran(T—A)+, we find 


(xz, (T—p)z)| 
\I-E,)E,z\| = s { el re Drs 0 
a I a)al isis 
|u—A\ [lel } 1 
S sup :z€Dr,z#0) < = 
{iPapay  FEDR=xOL <5 
for any x € H with {|E,x|| = 1. The same argument, with the above estimate 


replaced by ||(T—A)zx|| > callz||, yields sup{ ||([—E) EB, 2|| : x € H, ||E,zl| =1} < 
} , which concludes the proof. 


The open upper and lower complex halfplanes which we denote as ('* are 
connected components of the regularity domain for any symmetric operator A, so 
the function \ ++ def (A—X) assumes a constant value at each of them; we call these 
numbers deficiency indices of the operator A and denote them by nz, 


ns(A) := def (A Fi) = dimKer(A* +2). 


Since A’ = A*, one has na(A) = n+(A). By Theorem 4.3.9, a symmetric operator 
A is self-adjoint iff it is closed and ns(A) = 0 ; the condition na(A) = 0 is 
necessary and sufficient for A to be e.s.a. The deficiency indices are usually written 
as the ordered pair (n,(A),n_(A)). 
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4.7.2 Corollary: If the regularity domain of a symmetric operator A contains 
at least one real number \, then n4(A) = n_(A) = def (A—A) ; this is true, in 
particular, for any below bounded symmetric A. 


4.7.3 Example: Let us ask about the deficiency indices of operator P in Exam- 
ple 4.2.5. Since D(P*) = AC(J) and P*y = —iy’ , the problem reduces to finding 
the solutions to the equation ia’ + iy = 0 which belong to AC(J). Since the 
general solution is of the form wWs(r) = cxze** we have n,(P) =n_(P) =1 if J 
is a finite interval and n4(P) = n_(P) =0 for J = R. On the other hand, in the 
case of J = (0,00) only w_ belongs to AC(J), so the deficiency indices are not 
equal, n4(P) =0 while n_(P) = 1. 

' If A is a closed symmetric operator, then  € 7(A) is a regular value of A iff 
Ran (A—A) = H, i.e., Ker(A* — A) = {0}. Hence X € (+ belongs to o(A) iff 
n4(A) #0, and the analogous statement is valid for C~ . Taking into account the 
closedness of the spectrum, we arrive at the following conclusion. 


4.7.4 Proposition: (a) Let A be a closed symmetric operator; then just one of 
the following alternatives is valid: 


(i) (A) =C which happens iff both nz(A) are nonzero, 
(ii) o(A)=C\C~ if n(A) #0 and n_(A)=0, 

(iii) o(A) = C\ Ct iff ny(A) =0 and n_(A) £0, 

(iv) o(A) CR iff n,(A) =n_(A) =0. 


(b) A closed symmetric operator is self-adjoint if it has a real regular value. 


Any symmetric operator obeys the inclusion A C A*. The following theorem 
allows us to describe the domain of A* using D(A) and the deficiency subspaces. 


4.7.5 Theorem (the first von Neumann formula): Let A be a closed symmetric 
operator; then to any x € D(A*) there is a unique decomposition 


w2=Xt+x,+2., A*e = Ato—i(z,—-Z_), 


with x € D(A) and wz € Ker(A* +7). 

Proof: We denote Ks := Ker (A*+1). Let z be the projection of A*z—ix to Ky, so 
A*tz+iz =0. Since the operator A is closed, Ran (A—1) = Ran (A—i) = Ky; hence 
to y := Atx—ix—z there is an 2p € D(A) such that y = (A-7)z9 = (A*—i)z9. The 
decomposition A*x — ix = y+z then implies A* (x —2Xo- Sz =t(r%—-24%—- kz ; 
so the vector 2_ := x —2 — 4z belongs to K_. At the same time x, := iz € Ky 
which means that zo, 74 are the sought vectors. It remains for us to check the 
uniqueness; this requires to show that yo + y4+y-=0 with yo € D(A), ys € Kx 
implies yo = ys = y- = 0. Applying the operator (A* —7) to yot+y4+y_ we get 
(A—i)yo = 2iy,, but ys € Ky and (A—1)yo € Ran(A-—i) = Ky; hence y, = 0 
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and yo € Ker(A—i), which finally gives yo = 0. The second formula follows easily 
from the first. 


The obtained decomposition represents a starting point to construct self-adjoint 
extensions to a given symmetric operator. There are different ways to do that; we 
shall employ a simple trick which allows us to reduce the problem to constructing 
unitary extensions to an isometric operator. The inspiration comes from the function 
tir we) = i , which maps the real axis bijectively to the unit circle in C with 
the point z = 1 removed; the trick is based on an operator analogy of this relation. 
For brevity, we denote by £.5(H) the set of all closed symmetric operators on H. 


4.7.6 Lemma: To any A € L,,(H), the relation V4 := (A—i)(A+i)~' defines an 
isometric operator, and Ran (I — V4) = Da. 

Proof: The operators (A +i)~! exist, so V4 maps Ran(A+i) to Ran(A-—1) ; the 
two subspaces are closed since A is closed. If y € Ran(A+2) we have y = (A+i)x 
and Vay = (A—i)x for some x € Dag, and ||(A+i)z|| = ||(A—2)z||, which 
proves the isometry. Furthermore, the relation Ran (J — V4) = (J — V4)Ran (A+1) 
implies that y € H belongs to Ran(J — V4) iff there is x € Dy, such that 
y = (I — Va)(A+i)x = (A+i)z — (A—i)z = 2iz, so Ran(J-—V4)=Da. Bi 


We call C: At Vy, defined in this way the Cayley transformation; it maps 
L.s(H) to the set V(H) of all isometric operators on H with Ran(J —V)=H. 


4.7.7 Theorem: (a) The Cayley transformation is a bijection between £,,(#1) and 
V(H) with C7'(V) =i(7 + V)T-V)7. 


(b) The set £,,(H) of all self-adjoint operators is mapped by C to the set Uy, (H) 
of all unitary operators on # which do not have 4 = 1 as an eigenvalue. 


Proof: Let V € V(H) and suppose that Vy = y for some y € Dy. Since V is 
isometric, we have (x,y) = (z —- Vz,y) =0 for any x = (J—V)z with z€ Dy; 
then the condition Ran(J —V) = # gives y = 0, which means that J — V is 
invertible and Ay := i(1+V)(I—V)7! is a densely defined operator with the domain 
D(Ay) = Ran (I—V). We shall check that it is symmetric and closed. The relation 
(x, Avy) = (Ayz,y) for all z,y € D(Ay) = Ran (J —V) follows easily from the fact 
that V preserves the inner product. For any [x,y] € I'(Av), there is a sequence 
{zn} C Dy such that (I-V)z, > 2 and i(1+V)in > y, 80 % — 4(x — iy) and 
Vin > —3(x+iy). Since V is isometric, the subspace Dy is closed and the vector 
z:= 4(x — iy) belongs to it. Furthermore, V is continuous, so Vz = —h(x+iy) ; 
this yields x = (J -V)z and y=i(J + V)z, ie. [x,y] € P(Av). To prove (a), it 
remains to check that Ay = C7'(V). Let ye D(C(Av)), so y = (Av+)z for some 
xz € D(Ay), then there is z € Dy such that x = (I —V)z and Ayr =i(I+V)z. 
This implies C(Ay )y = (Av—i)z = i(1+ V)z-i(1 —V)z = 2iVz = Vy; the same 
argument in the reversed order yields the inclusion Dy C D(C(Ay)), so together 
we have C(Ay) = V. In the same way, we can prove the identity Agia) = A. 

If A is self-adjoint, then Ran(A +i) = H, so U := C(A) is unitary, and 
1 ¢ o,(U) due to the condition Ran(J -U) = H. It remains for us to prove 
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that Ay := C~!(U) is self-adjoint for any U € U,(H), i.e., that At C Av. Let 
y € D(Aj) so (y, Avx) = (y*,x) holds for some y* € H and all x € D(Ay) = 
(I — U)H; this is equivalent to the relation i(y, (J + U)z) = (y*,(I—U)z) for any 
z €H, which implies (J + U*)y = i(I — U*)y*. Now we have to apply operator U 
and add (I — U)y to both sides to get the expression y = $(I — U)(y—iy*) which 
shows that y€ Ran(J-—U)=D(Ay). § 


With these prerequisites we are now able to describe the above mentioned refor- 
mulation of the original problem in terms of unitary extensions of a given isometric 
operator. 


4.7.8 Proposition: Let A be a closed symmetric operator and V := C(A) its 
Cayley transform; then 


(a) an operator V’ > V is the Cayley transform of some nontrivial closed sym- 
metric extension A’ of A iff the following conditions hold simultaneously: 


(i) there are closed subspaces G4. C Ran (A+i)+ with dimG, = dimG_ > 0, 
(ii) D(V’) = Ran(A+i) @G, and RanV’ = Ran(A-i) 6G_, 


(iii) if «= y+z with y € Ran(A+7) and ze G,, then Vx = Vy+ Vz, 
where V is an isometric operator from G, to G_, 


(b) if the above conditions are valid and, in addition, dimG, = dimG_=d<oo, 
then n4(A’) = n.(A) —d, 


(c) operator A is maximal iff at least one of its deficiency indices is zero. 


Proof: By definition, D(V’) = Ran(A’+%) and RanV’ = Ran(A’— 1), and the 
subspaces Gz := Ran(A’+ i) Ran(A+ i)+ satisfy condition (ii). Furthermore, 
the isometric operator V :=V') Gx satisfies (iii); we shall show that RanV = ol 

We have (Vy, Vz) = (V'y,V'z) = (y,z) =0 for any z € Gy, y € Ran(A+2), s 

Vz € (V Ran(AH))+ = Ran (A~)-, one since Vz € RanV’ = Ran(A’-1) holds _ 
the same time, we get Vz € G_, i.e, RanV C G_. On the other hand, any vector 
z' € G_ C Ran(A’ —1) = RanV’ can, due to (ii), be written as z’ = V'(yo+2) 
with y € Ran(A+i) and 2 € G,. We have V’yo = Vy € Ran(A—1i) C Gt, so 
0 = (Vypo, 2’) = (yo, yot20) = ||yol|?, and therefore z' = V'% = Vz € RanV. Hence 
the isometric operator V maps G, on G_, which implies condition (a). On the 
other hand, conditions (ii) and (iii) in view of Problem 3.25 determine an isometric 
operator V’ > V = C(A); then V'=C(A’), where A’ := C71(V') € £,,(H) is an 
extension of A. 

To prove (b), it suffices to notice that # = Ran(A’+1)+ @ Ran(A+i) @Gu, 
so Ran(A+i)+ = Ran(A’+i)+ @Gx. Suppose finally, e.g., that n_(A) = 0; then 
Ran (A+i) = Dy =H, so V has no nontrivial isometric extensions, and therefore 
A has no nontrivial self-adjoint extensions. On the other hand, if both deficiency 
indices are nonzero then the above described construction yields an isometric exten- 
sion of V,so A cannot be maximal. f& 
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These results allow us to classify all symmetric extensions of a given symmetric 
operator A; without loss of generality we can assume that A is closed and discuss 
its closed symmetric extensions only. The following situations are possible: 


(i) At least one of the deficiency indices is zero; then A is a maximal symmetric 
operator which is self-adjoint iff n,(A) =n_(A) =0. 


(ii) Both deficiency indices are nonzero and n,(A) # n_(A) . Suppose for definite- 
ness that n,(A) > n_(A) ; then for any G, C Ran(A+i)*+ with dimG, < 
n_(A) there is a closed symmetric extension A’ > A. In particular, in the 
case Gy = Ran(A+i)+ the above construction yields a maximal isometric ex- 
tension V’ of V:=C(A) because D(V’) =H; the operator A’ := C71(V") 
then has Ran (A’ +7) = D(V’) =H, 2.e., n_(A’) =0. It is therefore a maxi- 
mal operator which is not self-adjoint because n,(A’) = n,(A) —n_(A) > 0; 
the operator A has no self-adjoint extensions. 


(iii) The deficiency indices are nonzero and equal each other, n4(A) = n_(A) =: d. 
Choosing Gz = Ran(A+i)+, we obtain a unitary extension V’ of C(A), so 
C-1(V') is self-adjoint. If d < oo, any maximal extension of A is self-adjoint 
by Proposition 8b. The situation is more complicated if the deficiency indices 
are infinite; then some maximal extensions of A are self-adjoint and some not. 
The last named case occurs, e.g., if G, = Ran(A+i)+ and G_ is a proper 
infinite-dimensional subspace in Ran(A—1)+ ; the construction then yields 
an operator V’ which is maximal isometric but not unitary, so C71(V’) is 
maximal but not self-adjoint. 


Let us summarize this discussion. 


4.7.9 Theorem: (a) A closed symmetric operator A has a nontrivial closed sym- 
metric extension iff both the deficiency indices nz(A) are nonzero. 


(b) The operator A has self-adjoint extensions iff its deficiency indices equal each 
other. 


(c) Let n4(A) = n_(A) =: d. If d < 00 then any maximal extension of A is self- 
adjoint; in the opposite case, there are non-selfadjoint maximal extensions. 


With a little effort, the construction of Proposition 8a provides an explicit ex- 
pression for symmetric extensions of a given operator A € Lo5(H). 


4.7.10 Theorem(the second von Neumann formula): We adopt the notation of 
Proposition 8, then for any y’ € D(A’) there are unique y € D(A) and ao € Gy 
such that 


y=yt+(I-V)mo, Aly = Ay+i(1+V)a0; 


the first formula allows us to write the domain of A’ ina direct-sum form, D(A’) = 
D(A) @ (I-V)G,.. 
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Proof: Since D(A’) = (I — V')D(V’), condition (ii) of Proposition 8a implies that 
for any y/ € D(A’) there are x € Dy = Ran(A+i) and zp € Gy such that 
y = (I —V')(x+2o) ; condition (iii) then gives (I — V’)z = (I - V)a € Dag and 
(I-V")ao = (I—V)zxo. Therefore denoting = (I—V)z, we get the first relation, 
and the inverse Cayley transformation, A’ = C~!(V’), yields the second, 


Aly = i(I+ Veta) = (+ V)et (+ V)to = Ay +i(l + V)20. 


The uniqueness of the decomposition follows from the invertibility of (I — V’): if 

y+(I—V)zo = 0 for some y € D4 and to €G,, then 0= (I—V)xr+(I-V) zo = 

(I — V’)(a+20) holds for x := (I — V)~!y € Dy, so x+29 = 0 ; however, the 

vectors z,Zo belong to mutually orthogonal subspaces; hence x = % = 0 and also 
=(I—V)z=0... fi 


If d := dimG 4 < oo, the formulas of the theorem can be given an even more 
explicit form. We choose orthonormal bases {e;}7_, and {f.}4_, in the subspaces 
G., G_ , respectively; then there is a bijective correspondence between isometric op- 
erators V : G,; + G_ and dxd unitary matrices (ujx) given by Ve, = Lege fy. 


Writing ro € Gy as Xp = Pe ding &€x , we get 
d fi d d 
y =yt>d &lee— Sd upefi| , A’y’ = Aytid & (ext+ >. upesj |. (47) 
kA we Si ga] 


In particular, these relations define, for an operator A € L,,(H) with finite defi- 
ciency indices (d,d), a map from the set of all dx d unitary matrices on the set of 
all self-adjoint extensions of A ; using the injectivity of the Cayley transformation 
together with Proposition 8a, we see that this map is bijective. 


4.7.11 Example (operator P revisited): We have found the deficiency indices in 
Example 3; according to it and Theorem 9, P is self-adjoint on J = R and 
maximal symmetric with no self-adjoint extensions for J = (0,00). If J = (a,b) is 
finite, the deficiency subspaces are spanned by the unit vectors 74: 7:(x) = ce** 
a 7. nalee= ce”, where c = ee a) eae ; then the isometric 
operators from G, to G_ correspond to complex numbers yp of the unit circle, 
|u| = 1. By Theorem 10, the domain of operator P() consists of the vectors 
w= ¢+&(ns —pn_) with de D(P) and € € C, on which it acts as 


Pippy = —id! + i€(ny + un_-) = —id! + 1€(—nl, + wn) = —iy’. 


It is easy to find the relation with the parametrization used in Example 4.2.5. Since 
o(a) = $(b) = 0, we have P(u) = Po with 6 := (e*-° — u)(1 — we*-°)-1,, where the 
last relation obviously describes a bijective map of the unit circle onto itself. 


Next we shall derive some simple spectral properties of self-adjoint extensions 
of a given closed symmetric operator A with finite deficiency indices (n,n). Let 
us start with the eigenvalues. Given 4 € €', we denote Na(A) := Ker(A—)) and 
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ma(A) := dim N4(A), so if A € op(A), then ma(A) is its multiplicity and Na(A) 
is the corresponding eigenspace; otherwise m,(A) = 0. 

We shall again consider closed extensions of A only. Let A’ > A be asymmetric 
extension, not necessarily self-adjoint. Obviously N4/(A) > Na(A) for any AEC, 
and due to the closedness of A, A’, both subspaces are closed. We denote by A(,) 
the orthogonal complement to Na(A) in Na(A), te. Na(A) = Na(A) AQ). 
Since N4(A) = Na(A) NM D(A), we have A(X) M D(A) = {0} ; on the other hand, 
A(A) @ D(A) c D(A’), so 


dim A(\) = dim(A(A) ® D(A))/D(A) < dim D(A)/D(A) < 1 


due to Problems 1.5 and 47; this shows how much the extension can increase the 
eigenvalue multiplicity: 


4.7.12 Proposition: Let A’ be asymmetric extension of an operator A € Le.(H) 
with finite deficiency indices (n,n). If A is an eigenvalue of A’, then ma-(A) < 
ma(A) +n. 


We shall show that if the deficiency indices are finite, the extension does not 
affect the essential spectrum. First, we prove an auxiliary result. Given an operator 
A € Lo.(H), and \ € €, we define A) as the part of A in the subspace Na(A)+. 
Obviously A, = A if A ¢ o{A) ; it is also easy to check that A) is a closed 
symmetric operator on the Hilbert space N,4(A)+, the resolvent (A,—A)7! exists, 
and Ran (A,—A) = Ran(A-—d). 


4.7.13 Proposition: Let A € £,,(H). A complex number 4 belongs to o¢,,(A) 
iff at least one of the following conditions is valid: 


(i) A is an eigenvalue of infinite multiplicity, 
(ii) the operator (A—A)~! is unbounded. 


Proof: The sufficient condition is easy in both cases. Suppose on the contrary that 
A is not an eigenvalue of infinite multiplicity, and at the same time (A —A)7! 
is bounded. Let {z,~} C Da, be an arbitrary sequence of unit vectors such that 
(A—A)x, — 0; to prove A ¢ oes3(A) , we have to show that it contains a convergent 
subsequence. We denote by y, and z, the projections of xz, to the subspaces 
Na(A) and Na(A)+, respectively; then (A,—A)zx = (A—A)ae — 0, and therefore 
also z, + 0 because (A—X)~' is bounded. On the other hand, {y,} is a bounded 
set in the subspace N4(A) which is finite-dimensional by assumption; hence there 
is a convergent subsequence {y,}, and the corresponding sequence {z,,} is then 
also convergent. & 


4.7.14 Theorem: Suppose that A, A’ € £,,(H) and operator A has deficiency 
indices (n,n). If A Cc A’, then a (A) C op(A’) and aeg(A) C oe.(A’) ; in 
particular, the essential spectra coincide, oess(A) = dess(A’), if n < 00. 

Proof: The inclusions follow directly from the appropriate definitions; it remains 
to check that A ¢ aess(A) implies \ ¢ oess(A’) for n < oo. In view of the two 
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previous propositions, 4 is not an eigenvalue of A’ with infinite multiplicity; hence 
we have to show that if (A—A)~! is bounded the same is true for (A’—A)-!. We 
denote L := (I —V)G,, then dim L = dim D(A’)/D(A) < n by Problem 47, so 
Ran (A’— A) = (A’—A)(D(A) @ L) = Ran(A—A) ®@ M, where dimM <n and 
M 1 Ran(A—A). Since the subspace Ran(A-—A) is closed the same is true for 
Ran (A’/—A) = Ran (A\—A), 1.e., the domain of the closed operator (A\—A)~! ; the 
result then follows from the closed-graph theorem. ff 


We shall conclude this section by proving a useful result which allows us to 
compare the extension resolvents directly. Let A,, Ag be self-adjoint extensions of 
a closed symmetric operator A. We call A the mazimal common part of Aj, A if 
it is an extension to any A such that Ac A; and AC Aj; it is clear that for any 
pair A,, Az the maximal common part always exists. 


4.7.15 Theorem (Krein formula): (a) Suppose that the maximal common part 
of self-adjoint operators A,, Ao has finite deficiency indices (n,n) ; then the 
relation 

(A:—z)"'— (Aa—2)? = So Ajn(Z) (ye(Z), +) ys (z) 
j,k=1 
holds for any z € p(Ai)M p(A2), where the matrix (A;,(z)) is nonsingular 
and y;(z), 7 =1,...,n, are linearly independent vectors from Ker (A*—z). 


(b) The functions A;,(-) and y;(-) can be chosen to be analytic in p(A,)M p(A2). 
In fact, we can define 


y;(z) = y5(z0) + (z—2)(A2—z)~"y5(%) 


for j = 1,...,n and z € p(A2), where % € ( \ R and y;(z0) are fixed 
linearly independent vectors in Ker (A*—zo) ; then 


(A(z1))j¢ = (A(Z2)) je — (21-22) (ye (Z1), ys(22)) 
holds for 21, z2 € p(A,)M p(Ag) and j,k =1,...,n. 


Proof: We denote the maximal common part as Ap and D, := (A;—-z)~!—(A2-z)!. 
If 2 = (Ap —z)y with y € D(Ao), the relation D,x = (Ra, (z)—Ra,(z))(Ao—z)y = 
y—y=0 gives D,x =0 for an arbitrary x € Ker(A}—Z) = Ran(Ay—z). On the 
other hand, we have (y, D,r) = ((Ra,(Z)—Ra,(Z))y, xz) = 0 for y € Ran (Ap-Z) and 
x € Ker(Aj—z), which means that D,xz € Ker(Aj—z) for x € Ker(Aj—Z). These 
subspaces are by assumption n-dimensional, so it is possible to choose orthonormal 
bases {e;(z)}%_, and {e;(Z)}%., in Ker(Aj—z) and Ker(Aj—Z), respectively, and 
to express the action of D, as 


Dep ¥ s(a)es(2) 
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for any x € H. The functions c;(-) are linear functionals which fulfil le;(a)| << 
v7, Ice(x)| < ||Dz2||, and since D, is bounded for z € p(A1) M p(A2), they are 
bounded. Then the Riesz lemma implies the existence of vectors f;(z),j7 =1,...,, 
such that c,;(x) = (f;(z),z). Now the vectors e;(z) are linearly independent, so 
D,x = 0 for x € (Ker(A%—Z))+ implies c;(z) = 0,7 = 1,...; hence f;(z) € 
Ker (A*—Z) for 7 = 1,...,n and we can express them using the basis {e;(Z)}}_1 
as f(z) = Dhar Age(z)ex(Z) , which gives 


Dye = J Ayu(2)(ex(2),2)e5(2). 


jk=1 


Suppose that the matrix—valued function (A;.(-)) is singular at some zo € p(A1)N 
p(Az) ; then the vectors f;(zo),j = 1,...,n, are linearly dependent so there is 
y € Ker(A§—z) perpendicular to all of them. This would mean D,,y = 0 ; in 
other words, (A;—2)~!y = (A2—zo)~!y , which further implies y € D(A,)N D(A2), 
and Ayy — Aey = (Ai — 2)D2,(A2 — 20)y = 9 by the second resolvent identity in 
contradiction with the assumption that Ag is the maximal common part of Ay, Ao. 
The proof of part (b) is left to the reader (Problem 52). & 


4.8 Ordinary differential operators 


In fact, the title of this section is too bold. We shall be concerned with a particular 
class of symmetric second-order operators corresponding to the formal expression 


fo — +V(z) (4.8) 
on an interval J := (a,b) C IR; however, most of the results and methods discussed 
here can be extended to a much wider class of ordinary differential operators (see the 
notes). The real-valued function V is assumed to belong to the class Lioe(a,b) of 
locally integrable functions, 7.e., integrable on any compact interval K C (a,b). We 
shall present a universal procedure which generalizes the argument of Example 4.2.5 
and associates with the formal expression (4.8) a pair of operators H and H on 
L?(a,6) such that His the maximal operator corresponding to @ and H isa closed 
symmetric operator with H* = H. In cases when the two operators do not coincide, 
we shall discuss self-adjoint extensions of H. 

The formal operator (4.8) is regular if b -—a < oo and V is integrable on 
L(a,b) ; otherwise it is singulur. In the same way we can classify the endpoints, 
€.g., a is regular if a > —oo and V is integrable on L(a,c) for some c > a and 
is otherwise singular; @ is clearly regular iff both the endpoints a,b are regular. 
By Je we denote the interval (a,b) amended by the regular endpoints, for instance, 
Je = (a,b) if the left endpoint is regular, etc.; the function V is by definition locally 
integrable in J;. 

Given an interval I C JR we denote by ac(I) the set of all functions f : 
IC such that f and f’ are absolutely continuous in J. It follows from the 
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definition that any function f € ac(J) has a second derivative a.e. in I which is 
locally integrable. Returning to the formal operator (4.8) we see that if f € ac(Je), 
then the function z + (é[f])(x) is defined a.e. in Jz and is locally integrable 
there. The absolute continuity allows us to perform integrations by parts which 
play an important role in the theory (Problem 53); in this sense ac(Je) is the 
natural (maximal) domain of the operator (4.8). 

Let g € Licc(Je) ; then a function f € ac(Je) is said to solve the differential 
equation é[f] = g if (é[f])(x) = g(x) holds ae. in J. Using standard methods 
(see the notes) one can prove the following existence and uniqueness result. 


4.8.1 Theorem: Let & be the formal operator (4.8) and g € Lig-(Je) ; then for 
any ce J and arbitrary complex yo, y, there is just one solution of the equation 
£[f] = g which satisfies the boundary conditions f(c) = yo and f’(c) = 


To a given formal expression ¢, we define the operator H on L*(a,b) by Hy := 
£[x)] ; its domain is the subspace 


= {b: peac(k), ly] € L’(a,6)}. 


The Lagrange formula of Problem 53 can be used for [c,d] C Je, and moreover, 
the limits [4,Jo = limpsos[d,V]e and [6,d) := lime .s[¢,U]e exist for any 
¢,w € D, even in the cases when the endpoints are singular or the one-sided limits 
of the functions ¢,¢’,w,~’ in them make no sense; this follows from the fact that 
L[d|b — dlfb] € L1(a,b). This yields the relation 


(Hd, 4) — (¢, HY) = (4, do — [b, Ya (4.9) 


for any ¢,W € D, which shows that H is not symmetric. 

Following the program mentioned in the introduction, we are now going to con- 
struct operator the H. We begin with the regular case where we define it as the 
restriction H := H} D to the set 


= {PED: ¥(a)=Y (a) = ¥(b) = Y'(b) = 0}. 
To check that H has the required properties we need an auxiliary result. 


4.8.2 Lemma: (a) If ¢o, ¢: are linearly independent solutions to the equation 
£[¢| = 0, then Ker H = (Ran H)* = {¢0, di}iin. 


(b) For any 7,71, 60,61 € €, there is a function ¢ € D such that g(a) = % 
and ¢ ")(b) =6,,r = 0,1, where we have denoted 6 := ¢ and ¢{ := ¢’. 


(c) If the relation (¢, Hw) = (n, W) is valid for some ¢,7 € L?(a,b) andall ye D, 
then ¢€ D and n= Hd = 4d]. 


nei The relation Ker H = {¢0, Pi }un is evident. Suppose that 7 € Ran FH, ie., 
= ¢/y] for some w € D ; then the Lagrange formula together with the eoriditionts 
(a) = )(b) =0,r = 0,1, gives [?7(x)d(x) dx = 0 for any ¢ € KerH, so 
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Ran H c (Ker H)+. On the other hand, let 7 € (Ker H)* ; then Theorem 1 implies 
the existence of ~ € ac{a,b] such that é{y] = n and y(a) = z’(a) = 0. Using 
the last conditions in combination with the Lagrange formula, we get 0 = (7, ¢,) = 
(E[}, Or) = [b, drlo, 7 = 0,1. Hence it is sufficient to choose the functions 9.1 so 
that ${(b) = 6,, ; then u(b) = ¥/(b) = 0, s0 WE D and n = fy] = He RanH. 

To prove (b), first consider the case yo = 71 = 0. Using the pair of linearly 
independent solutions which satisfy the conditions ${)(b) = 6,,, we construct the 
function 7 := ko¢o + ki¢,. The constants ko, k, can be chosen (uniquely) so that 
(¢o,n) = —6, and (1,7) = 6. Now 7 € ac{a,b] C L'(a,b), so by Theorem 1 
we can find dy € ac{a,b] such that = = 7 and dela) = O(a ) = 0 ; at the 
same time &[¢s] € L?(a,b) so ¢, € D, and the conditions ¢{)(b) = 6,. imply 
finally 6; = —($0, €[s]) = $4(6)_ and bo = (¢1,4[¢o]) = Go(6). In the same way 
we construct the function ¢, € D which satisfies the conditions ¢!”)(a) = 7, and 
o\(b) = 0; then $:= ¢. + dp is the sought function. 

As for (a), since 7 € L'(a,b) there is ¢ € D such that £(¢] = n. We have 
yw € D, so the Lagrange formula implies (n,w) = (¢, Hw). Since the left side is 
equal to (¢, Hw) with the help of the already proven assertion (a) we get @-CE 
(Ran H)+ = (Ker H)++ = Ker AF ; the last identity follows from the fact that Ker H 
is finite-dimensional. Hence ther’ are a,3 € € such that d= € + ado + 3q@; ; this 
implies ¢ € D and @¢])=4[C]=n. OH 


4.8.3 Proposition: If the formal opasaier fe 8) is regular, then H := H} D isa 
closed symmetric operator such that H* ; it has deficiency indices (2,2). 
Proof: Let 7 € D+; then Lemma 2c with é = 0 gives 7 = £(g] = 0, s0 A is densely 
defined. The same assertion implies H* C H, while the opposite inclusion follows 
from (4.9); together we get H* = H. To check the closedness, we have to prove 
H™ CH. Since H is symmetric, it satisfies H** C H*, and therefore H**y = €[y'] 
on its domain. We also have (w=, H*¢) = (H**w,@) for any @ € D(H*) = D(A), 
, (%, 4[@]) — (€[¥],¢) = 0 so the Lagrange formula implies [w,@], = [w,@]a. 

By ‘lean 2b, the boundary values of ”) are arbitrary; hence the last identity 
gives W')(a) = y(b) = 0 for r=0,1,4.€., we D. Finally, the relation H* = H 
means that the deficiency indices equal den number of linearly independent solutions 
to the equation é[¢] = +i¢ ; by Theorem 1, there are just two such solutions in the 
regular case. §§ 


The singular case is more complicated; operator H is now constructed in two 
steps. In the first step, H has to be restricted to a sufficiently small domain; it is 
natural to use the “minimal” restriction H := H}D with 


= {%e€D: suppy C (a,b) is compact }. 


4.8.4 Proposition: Operator H is symmetric and H* = H ; its deficiency indices 
are (n,n), where O< n< 2. 


Proof: Let (¢, Hw) = (n,~) for some ¢,n € L?(a,6) and all ~ € D ; we have 
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to show that ¢ € D and n = Hd. On a compact interval K := fexdren(apd), 
the formal expression @ is regular, and due to the previous proposition it defines 
a closed symmetric operator on L*(K) which we denote as Hx. We take any 
~ € D(Hx) and extend it to (a,b) putting ~(x) := 0 for x € (a,b) \ K ; then 
(Hv)(x) = (Hxw)(z) ae. in K and (Hx)(z) = 0 outside this interval, and 
therefore (¢, Hxw’) = (n,w)x . By Lemma 2c, the functions ¢ and ¢’ are absolutely 
continuous in K and n(x) = (é[¢])(x) a.e. in K, and since K was arbitrary, we 
get ¢ € D and n = H¢ ; it allows us to mimick the argument of the previous 
proof. The deficiency indices are again given by the number of linearly independent 
solutions to £[¢] = -i¢ which belong to L?(a, 6). In distinction to the regular case, 
the last condition is not fulfilled automatically; we only know that there are at most 
two such solutions and the deficiency indices equal each other by Problem 49. 


The sought closed symmetric operator generated by @ is now defined as H := H. 
It follows that H* = H* = H and H* = H, and these conditions in turn show 
that in the regular case both definitions of the operator H coincide. Its domain is 
D(H) := D ; it can be equivalently characterized by means of the boundary form 
(4.9) — see Problem 55. 


4.8.5 Example: Consider the simplest case when V = 0, i.e., £ = —d*/dz? ; 
write T, Dr, etc., instead of H, D, respectively. By Bonen 13, the subspace ‘ii 
coincides with ae domain of the operator P? so T = P?. In the case J = R, the 
operator P is self-adjoint, so T = T* = (P?)* c (P*)? = P? =T;; in conic 
with T Cc T this relation shows that the operator T on L?(JR) is pallial 

If J = (0,00), the formal operator ¢ is singular with one regular endpoint 
at zero. In view of Problem 55, the domain Dr now consists of all » € Dr = 
AC?(0,00) which fulfil the condition [¢,y%]o = limz.o[¢,¥]2 for any ¢ € Dr; 
however, ¢’ € L?(0,00), so the function z + [¢,p]z belongs to L1(0, 00) for all 
OWE Dr, which means that limz.00[?, Yl2 = 0. Hence ~ belongs to Dr iff 
$(0)~'(0) = #(0)w(0) for all ¢ € Dr, and using Problem 54a, we finally get 


= {ye AC*(0, 00): $(0) = ¥'(0) = 0}. 


The equation —f” = if has two linearly independent solutions, fs(x) := e 
where € := e~7*/4, but only f_ belongs to L?(0,00) ; hence the deficiency indices 
are (1,1) in this case. 


sex 


If é is singular with one regular endpoint, the conclusions of Proposition 4 can 
be strengthened in the following way. 


4.8.6 Proposition: Suppose that the formal operator (4.8) has just one regular 
endpoint; for definiteness, let it be a. Then 


(a) the domain D(H) consists just of the vectors ~ € D which satisfy the condi- 
tions (a) = #’(a) = 0 and [¢,%], =0 for any ge D, 


(b) the deficiency indices of H are (1,1) or (2,2), 
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(c) operator H has no eigenvalues. 


Proof: To prove (a), we have to check that the stated conditions are equivalent to 
those of Problem 55. Since a is a regular endpoint, ~(a) = y’(a) = 0 implies 
(¢,0], = 0 for any ¢ € D, and therefore (9, ~], — [¢,¥]a = 0. The opposite 
implication requires us to check that the last condition gives y(a) = y’(a) = 0. 
By Problem 54a, we can choose for any c € (a,6) ~~ ¢o,¢: € D such 
that (a) = 6, for r,s = 0,1 and ¢,(x) = 0 for z € (c,6), so [¢, Hh), = 
limz5-[¢r,V]2 = 0, i-e., [¢r, Plo = 0, which implies ~(a) = od = 0 in view of 
the conditions @{°) om =p 

Now denote the deficiency indices of H by (n,n). The first von Neumann 
formula implies that the dimension of the subspace Ker (H*+7) © Ker (H*—1) is 
dim D/D = 2n, so it is sufficient to check that the factor space D/D contains at 
least two linearly independent vectors. Consider the equivalence classes o, corre- 
sponding to the functions $, € D which satisfy the conditions ¢)(a) = 6,,, and 
suppose that Go¢0+Ai¢1 = 0, i.e., that there is ~ € D such that Sodot+ fdr = w. 
Assertion (a) then implies #(a) = w’(a) = 0, which together with the conditions 
#9)(a) = br, yields Gy = G, = 0 thus proving assertion (b). Finally, the rela- 
tion Hy = AW means that w solves the equation é(y| = AW with the boundary 
conditions ~(a) = y/(a)=0,so ~=0 by Theoreml. § 


This result allows us to treat the general singular case using the splitting trick 
which we are now going to describe. Given a singular expression & on (a,b), 
we choose a point c € (a,b) and consider ¢ on the intervals J; := (a,c) and 
Jz := (c,b). Let H, be the corresponding symmetric operators on L?(J,) as 
described above; it is clear that the direct sum A, ®@ Hp is (up to the natural 
isomorphism) a restriction of H. The operator H, ® Ho is symmetric and its 
closure fulfils Hy ® Hp = Hy @ Ho, where H, := H, (Problem 29c); moreover, the 
inclusion H, @ H2 C H implies H, @ H2 C H. The direct sum Hy © Hz is called 
a splitting of operator H . 

The splitting H;@ H2 provides a way to determine the deficiency indices of H 
from ns(H,), r= 1,2. To demonstrate this, let us first prove the relation 


dim(D(H)/D(H, ® H2)) = 2 


It follows from Proposition 6a that %(c) = ¢'(c) = 0 holds for any w € D(H, @H2). 
Furthermore, there are functions ¢o,¢, € Dy such that ${")(c) = 6,, for r,s = 0,1; 
to see this, it is sufficient to take a closed interval [cy,co| C (a,b) containing the 
point c and to construct by Problem 54a a square integrable function which fulfils 
the boundary conditions at c and equals zero outside of [c,,c2]. The subspace 
G := {¢0, bi}tin is two-dimensional because the Wronskian W(¢o,¢1) = 1 at the 
point z=c, GN D(H; & H2) = {0} and G@ D(H, @ Ha) C Dy ; it remains for 
us to prove that the last inclusion is in fact an identity. 

Let 6 € Dy and w := $- $(c)do — ¢'(c)d1, so W(c) = w'(c) = 0; in view i 
Proposition 6a we have to check that [~,mJa = 0 for all 7 € D, and idaalve = 
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for all 2 € Dy. We use Problem 54a again; due to it, we can extend a given 
m € D, to a function 7 € D which is zero in the vicinity of the point b. Then 
[¢, nb = 0 and by Problem 55, 0 = [¢, nla = [¢, m]a. Combining this with the fact 
that ¢o,¢ are zero in the vicinity of a, we get [~,mJa =0. In the same way we 
can check [w, 72], = 0, so together we get ~ € D(Hi ® Ho). 

We know that H and H; @ Ho are closed symmetric operators; hence the just 
proved relation, in combination with Proposition 4.7.8b, the second von Neumann 
formula, and Problems 46 and 55, yields the identity 


n4(H) = nz(A; @ He) —-2 = na(Ai) + n+(He) —2 


which represents the desired expression of the deficiency indices of operator H by 
means of those of H, and H2. Since c is a regular endpoint for both the last 
named operators, we have 1 < ni(H,) < 2 so, for instance, H is self-adjoint iff 
na(H-) = 1 for r= 1,2. 

It is often not easy to find the deficiency indices for a singular @ directly; how- 
ever, there is a variety of alternative methods using the behavior of function V. 
One of them will be mentioned below; we shall return to this question in a more 
general context in Sec.14.1. The splitting trick allows us to reduce the problem 
to investigation of the formal operators with one regular endpoint, which have the 
following remarkable property. 


4.8.7 Theorem: Let the formal operator (4.8) on (a,b) have one regular endpoint; 
then the following conditions are equivalent: 


(a) the deficiency indices of the operator H are (2,2), 
(b) to any \ € C, all solutions of the equation [7] = AW belong to L?(a,b), 
(c) there is A € JR such that all solutions of ¢{)] = Ax% belong to L?(a,b). 


Proof: The most difficult part is to show that (b) follows from (a). The implication 
is evident for \ € ( \ IR; hence we consider 4 € IR. Let the functions ¢1, 2 € D 
form a basis in Ker (H*—i) ; their Wronskian is constant (Problem 54b) and with a 
proper normalization we can set W := $165 — ¢,¢2 = 1. Let w be any solution to 
£[)] = AW and define g := (A—7)#, so the equation can be rewritten as £[y] — ip = 
g. Assume for definiteness that a is the regular endpoint, ~ € ac{a,b), and the 
function g is locally integrable in [a,b). In view of Theorem 1 and Problem 54b, 
w obeys, for any a1,Q2 € (, the equation 


U(E) = ayds(z) + anda(e) + (A—2) [ [rl dau) — bn(e)ou(u) (0) dy 


a.e. in (a,b). Denoting ¢ := |¢i| + |¢2| and m := max{|ai|,|a2|} we get the 
inequality 


Wa)? < Amon)? +2| ral ote) [ de)lvlav] 
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Next, we use the conditions ¢ € L?(a,b) and w € ac(a,b); the second of these 
implies ~ € L*(a,xz) , so using the Hélder inequality we get 


W(a)l? < 2mg(a))? + Mole)? [° ww)P ay, 


where we have denoted M := 2|\—i|? f° 6(x)?dz. Due to the absolute continuity of 
the integral, there is a c € (a,b) such that f° o(z)*dx < siz ; then 


ff [oc [ woreay] de < [order [ wonPay < x [ worlay. 


In combination with the above estimate, this gives f? |~(x)|?dx < 2m? f? ¢(x)?dx+ 
3 Je (x) |?dar , 80 


[wePae < am? [' 6(e)%de+ f We)Pae, 


which means that # € L?(c,b) due to Fatou’s lemma, and since ~ € acla,b), we 
finally get  € L*(a,b). 

Now (c) is a trivial consequence of (b), so it remains to prove that it implies 
(a). By Proposition 6c, H has no eigenvalues, so we can use Problem 47c and 
Proposition 6b which give def (H—A) < ns(H) < 2, and since def (H—A) = 2 by 
assumption, the proof is completed. & 


4.8.8 Remark: Combining Theorem 7 with Proposition 6 and the splitting trick, 
we get the so-called Weyl alternative for solutions of the equation é{¢] = Aw cor- 
responding to the formal operator (4.8). At each of the endpoints just one of the 
following possibilities is valid: 


(i) for any » € C, all solutions are square integrable in the vicinity of the endpoint, 


(ii) for any A € C, there is at least one solution which is not square integrable at 
the vicinity of the endpoint. 


Moreover, in case (ii) for any A € ('\ R there is just one square integrable solution 
which is unique up to a multiplicative constant. 

Notice that the alternative includes those cases with regular endpoints for which 
(i) is trivially valid. Possibility (i) is called the limit-circle case and corresponds to 
the deficiency indices (2,2), while (ii) is the limit-point case with deficiency indices 
(1,1) ; this terminology was introduced by H. Wey]. 


To conclude the section, let us present a simple result indicated above, which 
illustrates one way to determine the deficiency indices from the behavior of the 
function V. 


4.8.9 Proposition: Let ¢ be the formal operator (4.8) and suppose that the func- 
tion V is below bounded on J; then 
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(a) if J = (a,00) or J = (—oo,b) and a or b, respectively, is a regular endpoint, 
then the deficiency indices of the corresponding operator H are (1,1), 


(b) if J = IR, the operator H is self-adjoint. 


Proof: In case (a) it is sufficient to show that the equation é[y~] = 0 has at least 
one solution which does not belong to L?(J). Such a solution exists if y’ € L?(J) 
follows from the conditions ¢[y| = 0 and w~ € L?(J), since the assumption of 
existence of linearly independent solutions ¢%1, 2 to €[¢] =0 implies then that the 
function W := %,w05— v2 equals a nonzero constant on J, and at the same time 
W e€ L}(J), which is impossible because the interval J is unbounded. Suppose 
therefore £[7)] = 0 for some nonzero ~ € L?(J) ; the function V is real-valued, so 
we can assume without loss of generality that the same is true for 7. By assumption, 
Vu? = We, which yields for any xz € (a, 00) the estimate 


vi(av(z) > v'(a)pla)+ [ w@)*dy+ Vo [ vw)?dy, 


where Vo := infzey V(x). Hence y ¢ L?(a,00) implies limz_,.. y'(x)y(x) = +00, 
so the function 7(-)? is growing as x — oo ; however, this contradicts the condition 
w € L?(a,00) ; the proof for J = (—0o,b) is the same. Assertion (b) now follows 
from (a), by which the corresponding operators Hz on the half-axes have the 
deficiency indices (1,1), and the splitting—trick formula which yields na(H) = 
ns(H,)+nsi(H_)-2=0. IF 


4.9 Self—adjoint extensions of differential opera- 
tors 


Now we are going to continue the discussion started in the previous section by inves- 
tigating self-adjoint extensions of the closed symmetric operators H corresponding 
to formal expressions (4.8). It is clear that any such extension H’ is a restriction 
of the operator H , which means that it acts as a differential operator, H’¢ = ¢[d] 
for all ¢ € D(H’). 

We shall consider only the nontrivial case when at least one of the endpoints 
corresponds to the limit-circle case, so H is not self-adjoint. Since the deficiency 
indices are finite, n = 1 or 2, we can use formulas (4.7). Suppose that vectors 
ny € D, 1 <j <7, form an orthonormal basis in Ker (H*—i) ; then £[7;] = —i77,, 
so {7,}%_, is an orthonormal basis in Ker (H*+7). The self-adjoint extensions are 
parametrized by n x n unitary matrices u ; we have Hyd = é[¢| for ¢ € D(H), 
where the domain is defined by 


D, = D(Au) = {eed pees ES oe (nS » vie D, nee), 
k=1 j=l 
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4.9.1 Example: Consider again the operator T corresponding to the formal ex- 
pression —d?/dz?, now on an interval (0,b) ; for simplicity we set 6 := Vor. 
We shall construct a class of self-adjoint extensions; the general analysis is left to 
the reader (Problem 57). The deficiency indices are (2,2) and the functions 1; 
in the above formula can be chosen as 7(x) := ce and m2(x) := ce™-® , where 
c= 21/4(e2" _ 1)-1/2 and €:= e7™/4. “ 

The class in question will correspond to the matrices ug := a ( ry in ) , with 
9 from the unit circle in (. Each of these matrices has the eigenvalues 1, —7, 
so they are all unitarily equivalent. The domain Dy of the operator Ty := Tu, 
consists of the functions 


g=vtn (m . ahh = 55m) + (m 2 aim . sin) 


with ~ € D and 7,72 € €, but this formula is clearly not very transparent. An 
alternative is to use boundary conditions; computing ¢(0), ¢'(0), (0), ¢’(6) from 
the last formula we can directly check the relations 


$(b) = 89¢(0), $'(b) = 6¢'(0), 


where we have denoted 6 := (e” — 9)(Ye™ —1)~' (compared to Example 4.7.11, this 
represents yet another bijection of the unit circle onto itself). Using the first von 
Neumann formula, we can show that the converse is also true, 7.e., if d € D satisfies 
the stated boundary conditions, then it can be expressed as the above mentioned 
linear combination with some w € Dr and complex %. Hence Dy = {@«€ Dr : 
o(b) = 60(0), ¢'(b) = O¢'(0) } ; this means that Ty = P?, where Po are the self- 
adjoint extensions of the operator P from Example 4.2.5. 

This result allows us to find the spectrum of Ty. In view of to the relation 
Ts = P%, it has no negative eigenvalues; zero is an eigenvalue only if 6 = 1, 
i.e, 0 = 1. If A > 0 the general solution of the equation —¢” = Ad is TH 
> efVA2 + 6 e-tWre . substituting into the boundary conditions and introducing the 
real parameter d := + arg¥ (so that d+ J) maps the interval {0,1) bijectively 
onto the unit circle) we find that a solution contained in Dy exists iff A = Ax := 
2(k + d)?, k = 0,+1,+2,.... The corresponding normalized eigenvalues are , : 
dx(x) = b7'/2etV2(k+4)2 - in view of Example 2.2.2 these functions form a basis in 
L?(0,6), so the operator Ty has a pure point spectrum, o(Ty) = { (2(k+d)?: k= 
0,+1,...}. Moreover, the spectrum is simple if d € (0, 3) U G, 1), toy ln Z 0. 
In the case d= 0 the eigenvalue >» = 0 is simple while the other eigenvalues have 
multiplicity two; the same is true for every eigenvalue if d= 4 : 

This implies, in particular, that unitarily equivalent matrices u need not give 
rise to unitarily equivalent operators. In the present case all the matrices us are 
unitarily equivalent while the extensions corresponding to a pair of different d,,d2 
(0,4) have different spectra, so they cannot be unitarily equivalent. On the other 
hand, the operators Ta) and Tgi-a) for 9 € (0,1) have the same simple spectra, 
so they are unitarily equivalent (see the notes to Sec.4.4). 
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The moral of this example is that boundary conditions were much better suited 
to handle the extensions than the general formulas of Section 4.7. This suggests that 
we should see whether the same is true for a general H corresponding to the formal 
expression (4.8); a suitable starting point is the relation (4.9) — cf. Problem 58. 


Given a subspace L in a vector space V , we call vectors 21,...,2n € V linearly 
independent modulo L if no nontrivial linear combination of them belongs to L, 
t.e., if the condition }°_, a,x; € L implies ay =---=a,=0. 


4.9.2 Theorem: (a) Let (n,n) be the deficiency indices of the operator H. If 
the functions uw, € D, 1 < k < n, are linearly independent modulo D and 
[wre W5]4 — [we,wjla = 0 holds for 1 < j,k < n, then the functions ¢ € D 
which satisfy the conditions 


[we, blo — wr, Pla — 0, 1 < k = nN, 
form the domain of a self-adjoint extension of the operator H. 


(b) In this way we get all self-adjoint extensions; in other words, if a subspace 
D’ fulfilling D Cc D’ c D is such that H} D’ is self-adjoint, then there are 
functions uw, with the above described properties such that D’={gdeED: 
[wes blo — [we da=0, 15k <n}. 


Proof: By Theorem 4.7.5, D = D@®G, where G := Ker(H*—1) © Ker(H* +i) ; 
hence any w, can be expressed as uw, = Ye +7 with ~, € D and m € D. Using 
Problem 55 we find that w, can be replaced by 7, in the assumptions; we shall 
check that the subspace 


Do = {GE D: [we dle — [we dla =0, 1 Sk <n} 


coincides with D’ := D@{m,...,Mm}un- Problem 55 gives D’ C D, ; to prove the 
opposite inclusion, we complete {7} to a basis {n,}2", C G, so for any ¢ € D,,, 
there is a ~ € D and complex numbers (4,..., (an such that d= + 0%, Brrr. 
In view of the assumptions, we get then the system of equations 


a Br ([nes Mrlo = [Nes Mrla) = 0; i! < k < nN, 


r=n+1 


which is solved by @n41 = ++: = Gon = 0 only; otherwise a nontrivial solution to the 
system [nr 271 CkMk|y — [Mrs oka ChMk], = 0 would also exist for n+1<r< 2n 
but this would mean S~Z_, chm € D. Hence we get ¢ € D’, i.e, D’ = D, and 
using Problem 55 again we conclude the proof of assertion (a). 

The domain D’ of any extension is of the form D’ = D@L, where L is an 
n-dimensional subspace in G. Let {wx}Z_, be a basis in L, so the functions are 
linearly independent modulo D. Since u% € D’, the assumptions of part (a) are 
valid by Problem 55; it remains for us to prove that any ¢@ € D which satisfies the 
condition [wz, ¢], — [we, dla = 0, 1 << k <n, belongs to D’. To this end, we have 
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to complete the set {w,} to a basis in G and to repeat the argument from the first 
part of the proof. & 


Now we shall discuss in more detail some situations in which this result can be 
further simplified, so that we need not check that given functions u, are linearly 
independent modulo D. This happens, e.g., when formal operator @ is regular. 


4.9.3 Corollary: Let @ be a regular expression (4.8) on (a,6) ; then 


(a) any pair fi, fo of linearly independent functionals on D of the form 


fil) = anr(a) — an2G'(a) — Bero(b) + Beod' (0) , 


where the complex coefficients fulfil the conditions a%1@j2— an20j1 = Bur B52 - 
Bx23;, for j,k = 1,2, determines a self-adjoint extension A(fi, fe) of H 
with the domain 


D(fis fo) = {@€D: fel) =0, k= 1,2}, 


(b) to any self-adjoint extension H’ > H, there are linearly independent func- 


tionals f,, fe on D satisfying the above conditions and such that D(H’) = 
D(fi, fa) ‘ 
(2-3) x 


Proof: By Lemma 4.8.2b, there are functions w;,w2 € D such that vw, (a) = Ong 


and BY } oe Br; , so the functionals f,, fe can be expressed as 


ful) = [wr, db = [Wks Pa 


and w,W2 fulfil the assumptions of Theorem 2a. On the other hand, for given 
Ww, we € D which obey these assumptions, the last relation defines functionals f), fo, 
and we can readily check that they are linearly independent iff w;,we are linearly 
independent modulo D. § 


The functionals f, are linearly independent iff the same is true for the vectors 
(Qx1, —Qk2, —Be1, Geo), K = 1,2, in 4. In view of the conditions that the coef- 
ficients have to satisfy, four real parameters are left, as must be the case because 
the extensions correspond bijectively to 2 x 2 unitary matrices. Among these, the 
following two-parameter class is important. 


4.9.4 Example (separated boundary conditions:) Suppose that each of the func- 
tionals f;, contains quantities referring to one endpoint only, e.g., f1(¢) = ano¢(a)— 
ai29'(a) and f2(¢) = G216(b) — G22¢'(b). The assumptions of the previous corollary 
together with the fact that H(fi, fo) = H(yfi,6f2) for any nonzero 7,6 € © show 
that without loss of generality the coefficients can be chosen real and in the following 
normalized form, 


f(g) = $(@) cos§ — $'(a) sin€, fo(%) = o() cos + ¢'(b) sinn, 
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for a pair of parameters €,n € (0,7) ; the sign convention reflects the natural 
symmetry of the problem. Some properties of the operators Hig) := H(fi, fe) are 
collected in Problems 59 and 60. Among the conditions f,(¢) = 0 a distinguished 
role is played by those corresponding to values 0 and % of the parameter. The 
operator Ho, is said to satisfy the Dirichlet boundary condition, ¢(a) =0 at the 
endpoint a, while Hy, ,) satisfies the Neumann boundary condition, ¢'(a) = 0, and 


similarly for the other endpoint. 


Another situation in which Theorem 3 can be simplified occurs when @ has one 
singular endpoint, and there it corresponds to the limit—point case, i.e., the operator 
H has deficiency indices (1,1). Suppose for definiteness that the endpoint a is 
regular; then we have the following result. 


4.9.5 Corollary: For any € € [0,7), there is a self-adjoint extension H; of H 
with the domain 


Dz; := {¢@ED: g(a) cosé — ¢'(a) sinE = 0}, 

and on the other hand, to any self-adjoint extension H’ > H there is just one 
€ € (0,7) such that H’ = H} Dg. 

Proof: First we shall prove that [7, dy := limz—se_|n, dz = 0 holds for all ¢,n € D. 
We use the functions ¢o,¢, € D constructed in the proof of Proposition 4.8.6 which 
can be expressed as ¢, = %» +7, where 7, € D and 7 €G with n\(a) = Ors. 
Hence 7,7 are linearly independent and form a basis in G, so any @ € D can 
be written as @=wW+7%+%m with w~ € D. By Proposition 4.8.6, [n,%], = 0 
for any 7 € D, and since the ¢, are zero in the vicinity of 6, we have [n, mlb = 
[n, drlo = 0, 2e., [n,dlo = 0. The assertion now follows from Theorem 3. To 
any € € [0,7), we can choose a function w € D such that w'(a) = cosé and 
w(a) = sin€ which in view of the relation [w,¢], = 0 fulfils the assumptions of the 
theorem. On the other hand, for a given H’ there is a nonzero function w € D\ D 
such that w(a)w'(a) = w(a)w’(a) ; the domain of H’ then equals D’= {pe D: 
w'(a)b(a) — w(a)d’(a) = 0}. Hence it is sufficient to choose € = 0 for w(a) = 0 
and € = arctan(w’(a)/w(a)) otherwise. The injectivity of the map € ++ Hg is easily 
checked using Problem 54a. Wi 


Notice that there are alternative parametrizations of the above set of self-adjoint 
extensions; for instance, we can use c := cot€ setting H(c) := H}D(c), where 
D(c) :={¢eED: ¢'(a)—cd(a) = 0} for ce MR and D(co) = {dE D: d(a)=0} 
corresponding to the Dirichlet boundary condition. 


4.9.6 Example (the operator T on (0,00) ): By the corollary, the operator T of 
Example 4.8.5 has a one-parameter family of self-adjoint extensions whose domains 
are the subspaces in Dr = AC*(0,00) specified by the above boundary conditions 
at a = 0; we denote them as T(c). Let us determine their spectra. The equation 
—¢" = \o has a solution in AC?(0,00) only if A < 0; it is ¢(z) = ae~** where 
k := V—X. It follows that T(oo) as well as T(c) for c > 0 have no eigenvalues, 
while T(c) with c <0 has just one eigenvalue \ = —c* of unit multiplicity. 
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Next we shall show that o(T(0o)) = [0, 00) holds for the operator with Dirichlet 
boundary conditions. The spectrum is contained in JIR*+ because T(o0o) is positive; 
indeed, limz_.. 6(z)¢(z) = 0 due to Problem 12a, so (¢,T(0o)¢) = ||¢'||? > 0, 
and therefore inf o(T(0o)) > 0. To prove the opposite inclusion it is sufficient to 
choose a suitable sequence of unit vectors from D(T(0o)) , say 


dn: ¢n(t) = /é (e~2/" — e-#/™) elke 


for any k > 0; by astraightforward computation one shows that ||\¢%+k*¢,all? > 0, 
so Corollary 4.3.5a applies giving the sought result. 

This allows us to find the spectra of all the other extensions T(c). Each of them 
is self-adjoint, so its residual spectrum is empty, as well as o,es(T(0o)) . On the other 
hand, the deficiency indices of H are (1,1), 80 Oess(T(c)) = Gess(T(00)) = [0, 00) 
in view of Theorem 4.7.14. Combining these results, we get 


(0, 00) Scat wee 


o(T(c)) = 
{-c}U[0,00) ... c<0 


In conclusion, let us mention the explicit form of the resolvent for a regular 
£ with separated boundary conditions described in Example 4 (references to the 
singular case are given in the notes). 


4.9.7 Theorem: Let @ be a regular expression (4.8) on (a,b) and H the corre- 
sponding closed symmetric operator. Further, let Hj.) be a self-adjoint extension 
of H ; then 


(a) the resolvent of Hien) is for any A € p(Ajen)) a Hilbert-Schmidt integral 
operator with the kernel 


__ wr Wo(xr)ya(y) ae BS y 
ga(z, y) = Ww; Wa(x)yo(y) _" L< y 


where tq, are the solutions to é{y)] = Az) fulfilling the boundary conditions 
ta(a) = sin€, wi(a) = cos€ and y(b) = —sinn, W4(b) = cosy, and where 
Wap := W(a, Yo) is their Wronskian, 


(b) the spectrum of Hie») is pure point without accumulation points and simple, 
i.e., each eigenvalue has a unit multiplicity. 


Proof: The functions y, and y exist due to Theorem 4.8.1 and they are linearly 
independent, since otherwise they would belong to D(Hiem) and A would be an 
eigenvalue; hence W,, equals a nonzero constant on (a, b] due to Problem 54b. As 
elements of ac(a, | , the two functions are square integrable, and therefore the kernel 
ga belongs to L*((a,b) x (a,)). We denote the corresponding integral operator as 
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G); then by straightforward computation we check that (€—A)[G,¢] = ¢ holds for 
any ¢ € L(a,b), and that the function G)¢ satisfies the corresponding boundary 
conditions, i.e., that (Hien) — A)¢ = ¢ for all ¢ € L*(a,b) , which proves (a). 

In view of Problem 18a and the Hilbert-Schmidt property, G, is a normal 
compact operator; since it is injective, zero is its regular value. By Theorems 3.5.7 
and 3.5.9, there is now an orthonormal basis {¢;}92, C L?(a,b) and a sequence of 
nonzero numbers {j;}$2, such that Gy; = uj; and lim; 4; = 0. This implies 
; © D(Ajgn) and Hien = (he + )$;, 80 Hjgm)) has a pure point spectrum. 
The existence of an accumulation point would mean that a subsequence fulfilled 
ui, — pw — X,; however, that would contradict the condition yp; > 0. Finally, the 
spectrum is simple due to the fact that the solution of £[¢] = u;¢ is determined by 
the boundary conditions uniquely up to a multiplicative constant. {§f 


Notes to Chapter 4 


Section 4.1 As in the bounded-operator case, some classes of densely defined operators 
can be specified using their numerical range. If A is symmetric, e.g., we have O(A) C 
IR in view of Problem 3a. A symmetric operator is called below or above bounded if 
inf @(A) > —oco or sup O(A) < oo, respectively. In particular, A is said to be positive 
if O(A) C [0,00). Likewise, we define the dissipative and accretive operators as those 
whose numerical range is contained in the lower complex halfplane {z: Imz <0} and 
in {z: Rez >0}, respectively. More generally, an operator T is sectorial if O(T) is a 
subset of a sector {z: |arg(z—zo)| < 6} of the complex plane with an angle of 26 <7. 

The adjoint to a densely defined Banach-space operator T : % — Y is defined as 
follows: its domain D(T*) is the subspace of all g € )* for which there is an f € 4* 
such that g(Tx) = f(z) holds for all x € Dr, and T*g := f for these g. Hence JT" is 
a linear operator from Y* to +*. If 4, Y are reflexive and T is closable, then T* is 
densely defined and T** = T — cf. [Kal], Sec.III.5.5. 

Using the coherent states introduced in Sec.2.2, we can derive a useful integral repre- 
sentation for a class of unbounded operators on A?(€). Consider the set T consisting of 
operators which are densely defined and such that C C D(T)ND(T™*) ; then Theorem 2.2.7b 
yields. (Tf)(w) = (ew, Tf) = fo (T*ew)(2) f(z) e-l#’/2@z for any f € D(T), we €. Using 
the theorem once more, we get 


(Tf)(w) = ip (ew, Tez) f(zye""/"dz and (T"g)(w) = a (€z, Tew) gz) edz 


for any g € D(T*) ; in the second relation we have taken into account that C is total in 
A*(C) , so the adjoint is densely defined and belongs to T. In this way, we get a sort of 
matrix representation with respect to the “overcomplete basis” C. 


Section 4.2 The condition ||rn|| = 1 in the definition of the essential spectrum can be 
replaced by the requirement that sequence {r,} is bounded; we can easily check that 
the two definitions are equivalent. We have to stress that there are alternative definitions 
of Gess(T’) in the literature; however, all of them coincide when T is self-adjoint — cf. 
[Ka], Sec.X.1.2. The notion of the essential spectrum can also be introduced for closed 
operators on a Banach space — see [ Ka], Sec.IV.5.6. Theorem 7 implicitly contains the 
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following non-trivial statement: the subspaces D(T) and D(T*) are dense in # for any 
TEL(H). 


Section 4.8 If S is a symmetric perturbation to a self-adjoint A which is A-bounded 
with the relative bound one, then Theorem 12 is replaced by a weaker assertion known as 
the Wiist theorem: A+S is in this case essentially self-adjoint on any core of A. More 
about stability of self-adjointness with respect to symmetric perturbations can be found, 
e.g., in [Ka], Sec.V.4; [RS 2], Sec.X.2; | We], Sec.5.3. 


Section 4.4 Given a family of operators {Ta : a € I}, on Hilbert spaces Ha with the 
domains Da C Ha, we can construct the operator T := p> pla cn H := yt; He 
called the direct sum of the operator family {T.} in the following way: we set 


D:= {x €H: X(a)€ Da, > IlToX(a)|2 < x | 


and define (TX)(a) := T.X(a) for any X € D. It is easy to see that any subspace 
H®@) := {X : X(B) =0 for 6 #a} reduces T,, and if we denote T( :=T)H™ there 
is a natural unitary equivalence between the direct sum Es. 7 Zq and the orthogonal sum 
» I T‘@) . Some properties of direct sums are given in Problem 29. In a similar way, one 
can introduce the direct integral of an operator-valued function T: X — f¥ H(€) du(€) . 
The modifications are obvious: the sum in the definition in the domain is replaced by 
Sx IITOX Ol2dulE), ete. 

Proposition 6c shows that the unitary transformations T ++ UTU~! do not change 
o(T) ; we say that the spectrum is a unitary invariant. Other unitary invariants are the 
eigenvalues and their multiplicities. We could ask which are the quantities {pa} such that 
Pa(T) = pa(S) implies that the operators T,S are unitarily equivalent. If, e.g., T and S 
are normal operators with a pure point spectrum which have same eigenvalues including 
their multiplicities, then to any eigenvalue there is an isometric operator V), that maps 
Ker (S—A) to Ker(T'—A), and we can readily check that U := > = V) is unitary and 
T =USU~'. We shall see in the next chapter that in a sense this result extends to any 
normal operator. 


Section 4.6 A form is said to be sectorial if its numerical range is contained in a sector 
{zé€C : |arg(z —y)| < 6} of the complex plane for some y € ( and0<6< $. 
Most of the results discussed here have a straightforward generalization to sectorial forms 
— we refer, e.g., to [ Ka], Chap.VI, | RS 1], Sec.VIII.6. The Friedrichs extension is 
in a sense privileged among all self-adjoint extensions of a symmetric operator A ; by 
Theorem 11 it is the only one with the property that its operator domain is contained in 
D(s). Moreover, the Friedrichs extension has the smallest form domain of all self-adjoint 
extensions of A — see Problems 38, 39, and the notes to Section 4.9. Theorem 14 is 
named after T. Kato, J. Lions, P. Lax, A. Milgram and E. Nelson. 


Section 4.7 We sometimes want to know only that n4(A) = n_(A) holds for a given 
symmetric operator A, so its self-adjoint extensions exist. A sufficient condition is given 
in Corollary 2. Another way to check that the deficiency indices equal each other uses 
a conjugation operator commuting with A (see Problems 47, 48); this idea comes from 
J. von Neumann as does the most of the theory presented in this section. On the spaces 
L*(X,du) the complex conjugation, w 4 w, is usually used: the deficiency indices of a 
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symmetric A on L*(X,dy) are equal if x € D4 implies @ € Da and (Ay)(x) = (Ap)(x) 
holds for any ~ € Dg a.e. in X. Various physical applications of self-adjoint extensions 
will be discussed in Sections 14.6 and 15.4. 


Section 4.8 The proof of Theorem 1 is based on construction of a convergent iteration 
sequence; it can be found, e.g., in [| Nai 2], Sec.16.1 or [DS 2], Sec.XIII.1. We can 
extend to the equations considered here many results of the classical theory of differential 
equations (with smooth coefficients), e.g., the existence of a fundamental system of so- 
lutions to the homogeneous equation and the variation—of-constants method of solving a 
nonhomogeneous equation (Problem 54). In addition, most proofs of the classical theory 
— see, €.g., [Kam] — can also be used with appropriate modifications. 

In the general theory of ordinary symmetric differential operators one considers the 
so-called formally self-adjoint expressions 


it ath af. di 
fn: falfl = D(- (Pe a n=1,2,..., 


where po,..-,Pn are real-valued functions on an interval (a,b) ; in particular, for n = 1 
we get the Sturm—Liouville operator, which for pp = 1 yields the expression (4.8). If 
the functions p5', P1,---,Pn are locally integrable in (a,b), then Z, again determines 


a pair of operators H and H on L?(a,6), the first of which has the maximal domain 
consisting of all ¢ € L?(a,b) for which £,[¢] makes sense, and &,[¢] € L?(a,b). The 
operator H is uniquely determined by the requirement that it is symmetric, closed, and 
satisfies H* = H ; its deficiency indices are (m,m), where 0<m<vn. The origin of the 
limit—point/limit-circle terminology is explained, e.g., in [RS 2], notes to Sec.X.2. 
Proposition 9a actually holds under much weaker assumptions about the function V , 
€.g., it is sufficient that V(x) > —M(ax) holds for all x large enough, where M is a 
positive nondecreasing differentiable function on (a, 00) such that the function M’M~-3/2 
is bounded from above and [°° M~1/2(x) dx = oo ; the proof can be found in [ Nai 2], 
Sec.23, or with slightly modified assumptions in [ DS 2 ], Sec.VIII.6. An illustrative 
discussion of the halfline case is given in [RS 2], App. to Sec.X.1. From another point 
of view, Proposition 9 represents a particular case of the self—adjointness results we shall 


discuss in Section 14.1. 


Section 4.9 By Problem 40, the forms of Example 4.6.1c with V = 0 are associated with 
the self-adjoint extensions Tig.) of Example 4, if we set cg := cot€ and c, := —cotn, 
while the one with the coefficients ca = cp = 0 and the restricted domain { ¢ € AC{a, 6] : 
o(a) = ¢(b) = 0} corresponds to the Dirichlet operator Tjoo), which is the Friedrichs 
extension of T = P*. Similar conclusions can be made for the operator T on a halfline 
(Problem 62). These examples well illustrate some differences between the form and 
operator formalisms; while all the self-adjoint extensions Tig p) act as the same differential 
expression on different domains, the corresponding forms are different but have the same 
domain AC{a,b], with the exception of a single form whose domain is smaller and which 
is associated with the Friedrichs extension of the minimal operator T’. 

The explicit integral-operator representations for the resolvent in the singular case, 
as well as those for the regular case with nonseparated boundary conditions, are similar 
to that of Theorem 7; they can be found, e.g., in [ Nai 2], Sec.19; [DS 2], Sec.XIII.3; see 


also [ We], Sec.8.4. 
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Problems 
1. If is separable, then for any T € C(H) we can find a countable set {tn} C Dr 
which is dense in H. 


2. Prove: (a) the properties of an adjoint collected in Proposition 4.1.2. Show that the 
boundedness conditions in assertions (d),(e) are in general not necessary, 


(b) the conditions T € L(H), S € B(H) and TS € L(H) do not imply (TS)* = 


Sa”, 
(c) if T,S € L(H) and S has a bounded inverse, then TS belongs to £(H) and 
(GS) "SFT" 


3. Prove: (a) An operator A € L(H) is symmetric iff its numerical range O(A) is 
contained in JR. 


(b) A symmetric operator A is self-adjoint if RanA =H. 
(c) If A is self-adjoint and invertible, then A! is self-adjoint. 


(d) The sum of a self-adjoint and a Hermitean operator is self-adjoint. 


4. If A is symmetric then ||(A—A)z||? = ||(A — Re A)z|l? + [Im A}? ||x]]? holds for any 
AEC and xe Da. 


5. The domain of the operator Q on L?(JR) satisfies D(Q) c L'(IR). 
Hint: Apply the Holder inequality to x~!xy(x) on a suitable interval. 


6. Let T, be the operator from Example 4.1.4. Prove 


(a) D, #H provided the sequence {s;} is unbounded, 


(b) Ts4t D> Ts + Ty, where the inclusion turns to equality iff D, > Dsaz or 
Dt D Dstt , 


(c) Tse > T,T; , where the equality holds iff D, D Dy, 
(iT! een iff s; #0 for all 7 ; in that case we have T>! = T,-1, where 
otis {sj 1}. 


Hint: (a) If s is unbounded, there is an increasing sequence of positive integers 
{nj} such that |s,,| >. 
7. Prove: (a) Theorem 4.1.5 without reference to the closed-graph theorem. 
(b) A closed operator T on H with Dr =H is bounded. 


Hint: (a) Apply the uniform boundedness principle to { fy: y € H, |ly|| =1} CH", 
where fy(x) :=(y, Ax). (b) Show that T* is bounded and use Theorem 4.2.3. 


8. Prove: (a) If A is a symmetric operator then the following conditions are equivalent: 
(i) A* is symmetric, (ii) A* is self-adjoint, (iii) A is essentially self-adjoint. 


(b) If an operator A is e.s.a. the same is true for any symmetric extension A’ of 


it, and A’ = 4. 
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9. Let U be the operator on H@H given by U[z,y] := [—y, xz]. A linear operator T 
on H is densely defined if (UI'(T))+ is a graph; in that case [(T*) = (UI(T))+. 


10. Using operator V on H @®H defined by V[z, y] := [y, x] , prove 


(a) T is invertible iff VI'(T) is a graph; in that case, '(T-!) = VI(T), 
(b) if T is invertible and closed, T-! is also closed, 
(c) if T is invertible and T-! € L(H), then ['((T~)*) = VI(T"). 


11. Prove: (a) Let T be a linear operator on H. Its domain Dry can be equipped 
with the inner product (z,y)r := (x,y) + (Tz,Ty) ; then T is closed iff 
(Dr, (-,-)r) is a Hilbert space. 
(b) Using the previous result show that T is closed if there is a closed operator S 
such that Ds = Dr and ||Tz|| = ||Sz|| for all x € Dr. 


(c) If the operators T,S € C(H) have a comon core D and {|Tz|| = ||Sz|| holds 
for all e € D, then Dp = Dg and ||Tz|| = ||Sz|| is valid for any x € Dr. 


12. Using the notation of Example 4.2.5, prove 


(a) if @ € AC(a,oo) then limz.. 6(z) = 0 ; an analogous assertion is valid for 
p Ss AC(—o0o, b) ’ 

(b) AC(J) = AC(J) for any interval Jc R, 

(c) the operator P is unbounded; construct sequences {%,} C D(P) of unit 
vectors such that limp—co ||PWn|| = oo for J finite, semifinite, and infinite, 

(d) the operator P©®) on L?(0,1) defined as the restriction of P to D := 
{w € AC(0,1) : w(0) = 0} is closed but not symmetric; its adjoint acts as 
PO)* := iy! on D(P*) = {y € AC(0,1): Y(1) =0}. 


Hint: (a) Use the identity [7(6¢' + od’)(t) dt = |o(a)|? — |¢(a)|?. 


13. Given an interval J C IR denote AC?(J) := {py € L°(J): y! € AC(J)}. Prove 
that AC?(J) = D(P?) = D2, where D(P?) := {yp € AC(J): w! € AC(J)} and 
D2:={WeEL?: y,’ absolutely continuous in J, y” € L?(J)}. 


14. If T is a closed operator, then its regularity domain 7(T’) defined in Problem 1.64 
satisfies © \ 7(T) = o(T) \ r(T) = op(T) U Gesa(T) . 


15. (a) Let pa be a discrete measure concentrated on a countable subset {x;} C X. 
Then any function f : X —C is measurable and the corresponding operator 
Ty on L*(X,dya) is unitarily equivalent to T,,, where s¢ := {f(2;)}52y. 
Find the corresponding unitary operator. 
(b) The conclusions of Problem 6b-d extend to the Oe oat Ty of Example 4.3.3 
with the following modification: the inverse T7’ exists iff f(a) #0 p-a.e. 
in X ; in that case ta = T,, where g is defined by g(x) := f(x)~! for 
x € fC \ {0}) and arbitrarily in the remaining points. 
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22. 


23. 
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Let Ty be the operator from Example 4.3.3 on L?(X,du) with a finite measure, 
p(X) < oo. Prove 


(a) if the measure vy on X is defined by v(M) := fry e~F2)!? dua) , then f be- 
longs to L?(X,dv) for any p > 1, the operator V := T, with g(x) := elf(=)I*/2 
is unitary from L?(X,du) to L?(X,dv), and VT;V~! is the operator of mul- 
tiplication by f on L*(X, dv), 

(b) suppose that f € L?(X,du) for some p > 2 and set r := 2 ; then 
L"(X,du) C D(T;) and any D dense in L"(X,dy) is a core for Ty. 


Let yz be a Borel measure on JR? and f € L2,.(IR*, du) ; then C§°(R*) is a core 
for the operator Ty on L?(JR*, dy). 


Prove: (a) The resolvent of a normal operator T is normal for any p € p(T). 


(b) If A is self-adjoint and \ € o(A), then s-lim,0oR4(A+in) does not exist. 
Hint: (b) Since A is closed this would mean Ran(A—A) =H. 


Find the spectrum of any self-adjoint extension Pg of the operator P on L?(0,1) 
and show that it is pure point, while dess(P9) = Gess(P) = 9. 
Hint: To prove oes,(P) = @ use Example 1.7.6. 


Prove Corollary 4.3.10. 


Find an example of operators A,S such that S is A-bounded with the A-bound 
equal to one but for any b > 0 there is yy € Da such that ||Swzpl| > || Age|| +d|| well . 
Hint: Consider the operators of multiplication by z? and z*+ 2 on L*(IR), re- 
spectively. 


Suppose that A,S satisfy the assumptions of Theorem 4.3.12, and in addition, A 
is below bounded. Then A+S is also below bounded and its lower bound can be 
estimated by 


l-a’ 
where a,b are the numbers appearing in (4.3). 
Hint: ||SRa(u)|| <1 if w is less than the expression on the right side . 


MA+s 2 MA - max { alma| +o} ; 


Let {E;}§2, be a complete family of projections in H, i.e. E;Ex = 6j;.E; and 

jx1 #; = I (strong operator convergence). Further, let T be a closed linear 
operator which is reduced by all the projections E; ; its part in the subspace E;H is 
denoted as T;. Then x € Dr iff Ejx € Dr, j =1,2,..., and yer Tell" < 00; 
for any such vector x we have Tz = 521 T}z. The analogous assertion holds for 
any complete family {F.}. 


Prove: (a) Suppose that T is densely defined and E is a projection such that 
EDr C Dr and the subspace EH is T-invariant; then its complement 
(EH)+ is T*-invariant. In particular, if T is symmetric then the above 
conditions are sufficient for it to be reduced by E. 
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(b) If a projection E reduces a normal operator T then it also reduces T*, and 
T; :=T)EDy, Tz :=T}(I—E)Dr are normal operators on Hilbert spaces 
EH and (EH)+, respectively; moreover, we have o(T) = 0(T,) U o(T2). In 
particular, if T’ is self-adjoint (e.s.a., unitary), the same is true for operators 
T, and To. 


25. Prove: (a) Let T be an invertible operator on H, in general unbounded, which 
commutes with B € B(H) ; then T-! commutes with B. 


(b) Let {B,} C B(H) be a sequence of operators each of which commutes with a 
closed T and B, > B; then BTCTB. 


26. Let T,S be linear operators on H, B,C € B(H) and BIT CTB. Prove 


(a) if BS CSB, then B(T+S) C (T+S)B and BTS CTBS, 
(b) if CT CTC, then (B+C)T CT(B+C) and BCT CTBC, 
(c) if T is densely defined, then B*T* Cc T*B*, 

(d) if T is closable, then BT Cc TB. 


27. Let operators T and S be unitarily invariant, T = USU-!; then 


(a) the operators T and S have the same eigenvalues and their multiplicities, 
(b) if S is bounded the same is true for T and ||T|| = ||S|l, 


(c) if S is normal, then T is normal; in addition, if S has a pure point spectrum 
the same is true for T, 


(d) if S is closable, then T is also closable and T = USU—!, 


(e) if D is a core for S, then UD is a core for T ; in particular, T is e.s.a. on 
UD provided S is e.s.a. on D, 


(f) if a projection E reduces S, then UEU™! is a projection and reduces T’. 


28. Projections E,F which satisfy the condition ||E—F'|| <1 are unitarily equivalent. 
Hint: Use Proposition 3.2.11. 


29. Let T:= %¢;Ta be the direct sum of an operator family. Prove 


(a) if all the operators T, are bounded and supgey ||Talla < 00, then T is bounded 
and |{T|| :=supger ||Talle, 


(b) if all T, are densely defined, so is T and T* = 0®., Tz ; in particular, if 
all the operators T, are normal (symmetric, e.s.a., self-adjoint), then 7’ is 
respectively normal (symmetric, e.s.a., self-adjoint), 


(c) if all Ta are closed the same is true for T and T = 0%, To. 
80. Prove Theorem 4.5.2. 
31. Let B,,C, € B(H,), r=1,2. Prove 


(a) B, @ Bo =0 iff at least one of the operators Bj), Be is zero, 
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(b) if By @ By = C, @C2 #0, then C; =aB, and C2; = a~' By holds for some 
nonzero ae’. 


Prove the following properties of the bounded-operator tensor product: 


(a) B, @ Bo = B, Bo = BoB, where B, := B, @ Iq and Bo:=1;@ Bo, 

(b) |B, @ Ba| =|Bi| @| Bal, 

(c) if A, Ag are positive so is Ai @ Az and VA; @ Ap = VA1 @ VA2, 

(d) if A, > AL > 0, r=1,2, then A; @ Ap > A, @ AR > 0, 

(e) if E,, r= 1,2, is a projection, then Ran(£; ® £2) = Ran E; @ Ran E2. 
Suppose that E,, F, are nonzero projections on H,, r= 1,2, then the projections 
E:= E; @ Ey and F := F; @ Fy are also nonzero and the following analogue of 
Proposition 3.2.9 is valid: 

(a) E+F is a projection iff E,+F, or E2+F> is a projection, 

(b) E—F is a projection iff E,—F\, and E2—Fy are projections, 


(c) EF is a nonzero projection iff E\F, and E2F2 are nonzero projections; 
then we have (Ran £; ® Ran E2) 1 (Ran F; © Ran F2) = (Ran E; 1 Ran F)) @ 
(Ran Eo Ran Fp). 


The Fourier-Plancherel operator on L?(JR") can be expressed as F, = F®...2F. 


(a) Finish the proof of Proposition 4.5.6. 
(b) Find an example of operators for which (TS) ®(T2S2) # (T; ® Tz)(S; @ So). 


Hint: (b) Choose T; unbounded symmetric and Ker T2 = {0}. 
Given a tensor product Hi ® H2 with dim H2 < 00, prove 


(a) if 7; €C(H)) and T> is an invertible operator on H2, then T; © Tp» is closed, 
(b) if A, is self-adjoint the same is true for A; @ Ip. 


Let f: JR" — R bea measurable function with inf ess emn f(x) > —oo ; then the 
form s given by s(¢,W) := fin foyder with the domain D(s) := {@ € L?(IR*) : 
Sn \f||¢|?dx < 00} is densely defined, symmetric, below bounded, and closed. 


(a) Prove Proposition 4.6.2. 


(b) Let s be a symmetric below bounded form and a € @ ; then the conditions 
th 9, x and Yn te), y imply ata+yn £4, ar+y. 

(c) If s-,r = 1,2, are symmetric below bounded forms, the same is true for 
$ := 8, + 52. If s, are closed so is s ; if they are closable, s is also closable 
and 3C 3, +39. 


(d) Let s be a closed form, then a subspace D C D(s) is a core for it iff it is 
dense in H,. 
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39. Suppose that A is a symmetric below bounded operator, s is the closure of the 
form generated by A, and A, is the corresponding Friedrichs extension. Let A 
be a self-adjoint extension of A and denote by § the closure of the corresponding 
form; then D(&) > D(s). 

40. Prove: (a) Form t of Example 4.6.1c with V = 0 is closed and the operator A¢ 

associated with it is given by A:d := —¢” with D(A) := {@ € AC?(R) : 
$' (a) — cad(a) = 0, d/(b) + G(b) = 0} (compare with Example 4.9.4). 

(b) Let to be the restriction of t to D(to) := {¢ € AC|a, }] : $(a) = o(b) = 0}. 
Show that this form is closed and the associated self-adjoint operator is Ag, : 
And = —" with D(Ato) = {6 € AC7(IR) : $(a) = (6) = 0}. 

(c) Are these results preserved in the case of V #0 ? 


(d) Find an example of self-adjoint A,, Ag such that their form sum differs from 
Friedrichs extension of their operator sum, A;+A2 # (Ai1+A2)F. 


Hint: Use a relative-boundedness argument. As for (d), try A, := Ae, where 
AW j Al?) are the operators of part (a) corresponding to different pairs [ca, cg] . 


41. Let V be an isometric operator; then def (V—A) = dim(RanV)+ if |\| <1 and 
def (V—A) = dim(Dy)+ if |A| >1. 


42. If A is symmetric and B Hermitean, then n4(A) = nsi(A+B). 
Hint: Mimicking the proof of Theorem 4.7.1 show that t +> def(A+tB +7) is 
constant on RR. 


43. Let A be aclosed symmetric operator. Prove 


(a) the decomposition D(A*) = D(A) @Ker (A*+¢) @Ker (A*—¢) of Theorem 4.7.5 
is orthogonal with respect to the inner product (a, y),4 := (A*a, A*y) + (2,y) 
on D(A‘), 


(b) Im (a, A*x) = ||x_||? — |lz4|? holds for any x € D(A*). 


44. Prove: (a) A real number A is an eigenvalue of an operator A € Les(H) iff 


jie $33 is an eigenvalue of its Cayley transform C(A). 
(b) A self-adjoint operator A commutes with B € B(H) iff C(A)B = BC(A). 
(c) Let U: H +H, bea unitary operator; then UAU~! belongs to L..(H1) for 
any A € L,.(H) and UC(A)U-! = C(UAU™) ; similarly, UVU-! belongs 
to V(H;) for any V € V(H) and UC-1(V)U-! = C-(UVU-}).. 


45. Let S be the right-shift operator from Example 3.1.3. Show that the domain of 
2 
Ag := C~(S) is D(As) = {2 =p eies < een La é| < oo } and find the 
Fourier coefficients (e,, Asx) for any x € D(As) and k = 1,2,.... What are the 
deficiency indices of As? 


46. The Cayley transform of operator P on L?(0,00) from Example 4.2.5 is C(P) = 
S*} {yo}4+ where S* is the left-shift operator with respect to the orthonormal basis 
{Un} 29 C L?(0, 00), where pn(x) := V2e-*Ly(2x) — cf. Example 2.2.3. 
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47. Let A be aclosed symmetric operator with finite deficiency indices (n,n). Using 


48. 


49. 


50. 


51, 


52. 


O83. 


54. 


the notation of Theorem 4.7.10, prove 
(a) dim(I —V)G4 = dimG, <n, 
(b) dim(D(A’)/D(A)) <n; the relation turns to equality iff A’ is self-adjoint, 
(c) def (A—A) <n holds for any \ € IR which is not an eigenvalue of A. 

Hint: (c) A*} (D(A) @ Ker(A*—A)) is a symmetric extension of A. 


Let J be a conjugation operator on 4H, i.e., an antiunitary operator such that 
J? =I. Prove 


(a) the relations JE+ = (JL)+, JL = JL and dimJL = dimTL hold for any 
subspace L CH, 

(b) if subspaces L,L' c H fulfil JZ c L’ and JL’ C L, then JL = L’ and 
Why 


Using the results of the previous problem, prove the following von Neumann theorem: 
Let A be a symmetric operator. If there is a conjugation operator J such that 
JAC AJ, then n4(A) = n_(A). 


Let A be a closed symmetric operator, then 


(a) if A ¢ oes,(A) , then Ran(A—A) is a closed subspace, 


(b) if A € oess(A) is not an eigenvalue of infinite multiplicity, then Ran(A—A) is 
not closed. 


Let A be a closed symmetric operator with finite deficiency indices (n,n). If 
def (A—A) <n holds for some real number A, then the latter belongs to the spec- 
trum of any self-adjoint extension A’ of A. In addition, if A ¢ op(A) then it 
belongs to ¢5,(A’). 


Prove Theorem 4.7.15b. 
Hint: Use the operator U(zi,z2) := (Ao—22)(Ao—21)7! and the first resolvent 
identity. 


Prove Lagrange’s formula for the formal operator (4.8), i.e., the identity 
d — 
[ iia - Fetal) (x) ae = Uf, g1a— IF, ge 


with [f,g]z := (f9' — f'9)(x) for all f,g € ac(Jg) and any compact fevd}iaeJ; . 
Let £ be the formal operator (4.8) on J = (a,b). Prove 


(a) if the endpoint a is regular, then for any c € (a,b) and arbitrary 7,7, € C 
there is ¢ € D such that (a) = yo, O(a) =, and $(r) =0 on (c,d), 
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56. 
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(b) let g € Licc(Je) 3 if fi, fe are linearly independent solutions to the equation 
£[f] = g, then their Wronskian W := ffi — fife is a nonzero constant 
function on J, and to any cé Je the function 


f= WI POW) - AOAMMaW ay 


is the solution corresponding to boundary conditions f(c) = f’(c) =0. 


Let H be the closed symmetric operator corresponding to the expression (4.8); then 
w€D belongs to D= D(H) iff [¢,¥], —[¢,H]a =0 is valid for any ge D. If 
is regular, it is further equivalent to the conditions w")(a) = p)(b) = 0, r=0,1. 


Let A,;, r = 1,2, be a closed symmetric operator on a Hilbert space H,, and 
A:= A; @ Ag; then : : 

(a) ns(A) =ns(A1) +n(Ao), 

(b) Cess(A) = Jess(A1) U Oess( Ag) : 
Hint: (b) Use Proposition 4.7.13. 


(a) Perform the calculations in Example 4.9.1. 

(b) Using the same method, construct all self-adjoint extensions of T'. 

With the notation of Section 4.8, let D’ be a subspace such that DC D’/c D; 
then the operator H’ := H} D’ is self-adjoint iff the following conditions are valid 
simultaneously: 

(i) [¢, dle —[¢, vla =0 for any ¢,y € D’, 

(ii) if ¢ € D satisfies (6, v], — [¢,]a = 0 for all we D’, then de D’. 
Let 7; be the operator of Example 4.9.1 corresponding to the boundary conditions 


$(b) = ¢(0) and ¢/(b) = ¢’(0). Prove that the resolvent (T, — z)~! is an integral 
operator with the kernel 


2 ' . b b 
guaga= “kenge (2 k(rx< +6) sinkry — cosk (= > 3) cos k (u - 3) . 


where k:= ./z, re := min{z,y} and zy := max{z, y}, and show that the function 
z+ g(x,y) has poles only at the points of o(7T,). Find the resolvent kernels for 
the other extensions of Example 4.9.1. 

Hint: Use Theorem 4.9.7 and Krein’s formula. 


Let Hien) be the operator of Example 4.9.4. Prove 


(a) the map [€,7] ++ Hye.) is injective on [0,m) x [0,7), 


(b) if V(b+a—2x) = V(x) holds for a.a. x € (a,b), then He) and Hinge) are 
unitarily equivalent by U : (Uf)(x) = f(b+a—z). 


Let Tien) be the operator corresponding to the expression —d?/dx? on (a,b), 
1 := b—a < oo, with separated boundary conditions, and Tyg := Tye,¢. 
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(a) Find the spectrum of Tj¢ 9). 
(b) Find the spectrum of Tig _¢, . 
(c) With the exception of the lowest eigenvalue, the spectra of Tp := Tj) and 
Ty := Tix) coincide: we have AV =0 and AN = AP = ()?, n=1,2,..., 
(d) to any € € (0, F) U(§, 7), there is just one eigenvalue of Tig in each interval 
2 
Gay ; (Se) ) ,n=0,1,..., and it is a monotonic function of €. 


(e) How many negative eigenvalues has the operator Tig) and for which values of 
the parameter? 


62. Let T be the operator of Example 4.8.5 on L?(0,00). Show that its Friedrichs 
extension is the operator Too) with the Dirichlet boundary condition from Exam- 
ple 4.9.6. Find the forms associated with the other self-adjoint extensions T(c). 


Chapter 5 


Spectral theory 


5.1 Projection—valued measures 


The central point of this chapter is the spectral theorem which we shall prove in 
Section 5.3; before doing that we must generalize the standard integration theory 
to the case of measures whose values are projections in a given Hilbert space rather 
than real or complex numbers. Given a pair (X,A) where X is a set and A a 
o-field of its subsets, we define a projection—valued (or spectral) measure as 
amap E: A— B(H) which fulfils the following conditions: 


(pm1) E(M) is a projection for any Me A, 
(pm2) EUR*) =I, 


(pm3) E (Un Mn) = Xn E(M,) holds for any at most countable disjoint system 
{M,} C A; if it is infinite, then the right side is understood as the strong 
limit of the corresponding sequence of partial sums. 


The additivity requirement implies, in particular, E(@) = 0. If sets M,N € A 
are disjoint, then E(M UN) = E(M) + E(N) is a projection, and therefore 
E(M)E(N) = 0; this means that the projection E (U, M,) in (pm3) corresponds 
to the orthogonal sum @, Ran E(M,) of the corresponding subspaces. As in the 
case of numerical measure, we introduce the notions of an F-zero set, and of a 
proposition—valued function which is valid E-a.e. in X. 

In view of the intended application, we restrict our attention to projection— 
valued measures on JR? defined on the o-field A := B¢ of Borel sets. Let us first 
list some of their elementary properties (Problem 1). 


5.1.1 Proposition: (a) Let M, N be any Borel sets in IR¢. Then 
E(MNN) = E(M)E(N) = E(N)E(M), 
E(M UN) E(M) + E(N)- E(MNN); 


the inclusion M Cc N implies E(M) < E(N), and E(M) = E(N) holds iff 
E(MAN) =0. 
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(b) If {M,} is a nondecreasing and {N,} a nonincreasing sequence in B¢ , then 
E ( U Ms) =slimE(M,), FE (i Nn) = s-lim E(N,). 
m= noo ail NO 


(c) The map p, : B¢ — IR* defined to any z € H by p2(M) := (x, E(M)z) is 
a finite Borel measure, and similarly vry : Vey(M) := (2, E(M)y) is for any 
pair of vectors x,y € H acomplex measure with Revzy = j(May— Hey) and 
Im Vey = 5 (Maiy— Horiy) - 


5.1.2 Examples: (a) Suppose that {P;} € B(H) is an at most countable complete 
system of mutually orthogonal nonzero projections and A := {A;} Cc JR¢ 
has the same cardinality; then we define Ep(M) := 0; xm(A;)P; for any 
M e€ B?. It is clear that Ep fulfils the conditions (pm1) and (pm2). Let 
{Mn}, C Bt be a disjoint system and M := U72, Mn, then (x, Ep(M)z) = 
oj En XmMn (Ay) (x, Pjx) holds for any z € H. The double series has non- 
negative terms so it can be rearranged; this yields the identity (2, Ep(M)z) = 
Vr (2, Ep(Mn)x). Since the family {P;} is orthogonal! the same is true for 
{Ep(M,)} and the last relation implies Ep(Af) = >, Ep(.!M,) . where the 
series converges with respect to the strong operator topology. i.e.. the condition 
(pm3). The measure Ep(-) is said to be discrete with the support A. 


(b) Given M € B* we define E(M) as the operator of multiplication by the 
characteristic function of this set, E(f) := T,,, in the notation of Exam- 
ple 4.3.3. If M and N are disjoint, we have xazun = Xar + Xn. and therefore 
E(MUN) = E(M)+ E(N) ; the o~additivity can be checked easily using the 
dominated-convergence theorem. Finally, EUR?) = 1, so E is a projection— 
valued measure. 


In the particular case X = R it is useful also to introduce another class of 
maps whose values are projections on a given H. We say that a nondecreasing 
map t +> EF; is a spectral decomposition (or decomposition of unity) if it is right- 
continuous, Fy4o := s-lim;44F; = E,, and satisfies the conditions 


This definition makes sense due to the fact that the assertion of Theorem 3.2.12 
extends to one-parameter families of projections (Problem 2a). A spectral decom- 
position will often be denoted as {F;}. 


5.1.3 Proposition: Any projection valued measure E on R determines a spectral 
decomposition by FE, := E(—oo, t]. 

Proof: Monotonicity follows from the additivity of E. The relation (—oo, t] = 
M1 (—00,t + n7"} together with Proposition 1b gives E:49 = E;, and in the same 
way we check the relations (5.1). §§ 
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Moreover, a little later we shall see that this natural correspondence between 
the two classes of projection—-valued maps in fact holds in the other direction too. 


5.1.4 Example (spectral decompositions for dim < oo ): In this case any point 
u € RR has a right neighborhood (u,u+6) on which a given {F;} is constant, 
FE; = E,. Suppose that this is not true; then there is a decreasing sequence {tn} 
converging to u such that E,, # E, for all n, and the right continuity of {E;} 
implies s-limp.o/;,, = E,. We may assume without loss of generality that all Fi, 
are mutually different, but then {Bi “A Ey,.)H} is an infinite family of orthogonal 
subspaces, and this contradicts the assumption dimH < oo. The same argument 
yields the existence of a left neighborhood (u—6,u) on which FE; = Ey_o, and real 
numbers a < @ such that FE, =0 and Eg =; in particular, any point at which 
t ++ E; is continuous has a neighborhood where the map is constant. 

This allows us to describe the general form of a spectral decomposition on a 
finite-dimensional 71. We denote u, := sup{t: FE; =0}. It is clear that F,,-9 = 
0 and u; cannot be a continuity point, so Ey, #0. In the case E,, #1 we set 
ug :=sup{t > u,: E; = Ey, }, etc. ; the process must end at last at the N-th step, 
where NV := dim#. Concluding the argument, we can say that for a given {F;} 
there are real numbers u; < ug <--- < u, and a complete system of orthogonal 
projections fh where 1 <n < dim#H, such that Fy = Dy X(-c0,q (Uy) Pj 
holds for any t€ AR. 


Next we shall show how projection—valued measures can be constructed starting 
from a suitable projection—valued set function; this is a direct extension of the 
standard procedure for numerical-valued measures — see Appendix A.2. Suppose 
we have a function FE which associates a projection E(J) with any bounded interval 
J C RR? and satisfies the following conditions: 


(i) E(-) is additive, 
(ii) E(J)E(K) =0 if JNK =9, 


(iti) the function fiz := (2, E(-)x) is regular for each x € H and the relation 
sup{ jiz(J): J € J4} = ||z||? holds for any interval J € 7%. 


The condition (ii) implies, in particular, the identity E(JM K) = E(J)E(K) = 
E(K)E(J) for any J,K € J*. 

By Theorem A.2.4, there is a unique Borel measure pz, on JR? such that 
ba(J) = jig(J) holds for any J € J4. The condition (iii) then gives 


Hor >= la|? He , [x (JR*) = |x|? ; 


to check the last relation, it is sufficient to realize that there is a sequence {J,} of 
bounded intervals such that pz(M,) > ||z|/? —n7!. Given {u.}, we can associate 
with any pair x,y € H the signed measures Ozy := }(Hriy—Me-y) and Try 2= 
i(Me+iy—Mx-iy) , and the complex measure vzy defined by Vey = Ory — iTzy. In 
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particular, if 2 = y we get Ver = Orz = Hz from the first of the above relations, 
and the polarization formula implies 


Vey(J) = (2, E(J)y), JE T*. 


To demonstrate that the set function F can be extended to a projection—valued 
measure, we first prove some auxiliary assertions. 


5.1.5 Lemma: Let M,N be Borel sets in R¢ ; then 


(a) there is a positive operator E(M) € B(H) such that vzy(M) = (zr, E(M)y) 
holds for any z,y € H, 


(b) an operator B € B(H) commutes with E(M) if it commutes with E(J) for 
alleen T “a 


(c) E(M)E(N) = E(N)E(M) = E(MON). 


Proof: To prove (a), it is sufficient to check that the form [z,y] +> fm(z,y) := 
Vey(M) is sesquilinear, bounded, and positive. We have Vzay42(J) = QUry(J) + 
Vez(J) for any x,y,z € H, a€ © and J € J*; then by Proposition A.4.2 the 
same relation holds for all M € B4, which means that the form fy, is linear in the 
right argument. In the same way we get Vzy(M) = vyz(.\) , so it is also sesquilinear 
and symmetric. The relation fy(x,xr) = ur(M) shows it is positive, and finally 
the Schwartz inequality yields |fru(x,y)|? < we(M)uy(M) < ueUR*)uyUR%) < 
\|z||?\|yl?. The condition BE(J) = E(J)B implies vg-z,4(J) = vr,By(J) for all 
x,y €H and J € J‘; using Proposition A.4.2 again we get the same identity for 
all M € B?, so (b) follows from the already proven assertion (a). 

Let u’) be the measure generated by pi, and the function yy. so p%)(M) := 
Uz(M NN) = (2, E(MON)z), where the last relation follows from assertion (a). 
Hence p\/)(K) = (x, E(J)E(K)x) = py(K) holds for any pair J,K € 74, where 
we have denoted y := E(J)x. By Theorem A.2.4, pl/)(M) = jy(M) for any 
M ¢€ B* and since the vector x was arbitrary, assertion (a) gives E(MN J) = 
E(J)E(M)E(J). The operators E(M), E(J) commute by assertion (b) so the right 
side can be rewritten as E(M)E(J), i.e., we get wO')(J) = (a, E(J)E(N)ax). The 


commutativity of E(J),E(M) implies E(J) = /E(N)E(J)\/E(N) by Proposi- 


tion 3.2.6, so pl “ my — ) with z:= ,/ BN )e ; 4 the above argument, 


we arrive at pi zr, E(N)E(M),/E(N)z) = M)E(N)z) , which 


concludes the “2 


5.1.6 Theorem: Any set function E on 7% which fulfils conditions (i)—(iii) extends 

uniquely to a projection-valued measure E on JR*. Furthermore, if an operator 

B € B(H) commutes with E(J) for any J € 77, then BE(M) = E(M)B holds 

for all Borel M Cc IR?. 

Proof: Assertion (a) of the lemma tells us that E(-) is an extension of FE to the 
o-field B¢ and E(UR*) = I, and putting M = N in (c) we find that E(M) is a 
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projection. The o—additivity of E follows from the o—additivity of the measure pz 
and Theorem 3.2.12, and the uniqueness is a consequence of Theorem A.2.4. The 
last assertion is implied by assertion (b) of the lemma. ff 


Now we want to prove the above mentioned converse to Proposition 3. 


5.1.7 Corollary: There is a bijective correspondence between the spectral decom- 
positions {£;} on H and the projection—valued measures E’: IR > B(H). 


Proof: A spectral decomposition {E;} determines a set function E on J which 
fulfils conditions (i) and (ii) — see Problem 3b — and the corresponding function 
fiz is regular due to the right continuity of t+ E,. Indeed, in view of Problem 3a to 
any € > 0 we can find a neighborhood (a,a+6) in which (x, yx) < (x, Eyx)+$e, 
so (x, Fa4s-ox) < (x, Fax) +e. By the same argument, there is 7 > 0 such that 
(2, Ey_yv) > (x, Eyp_ox)—€, and therefore fiz(Jr) > fiz(J)—2e holds for J := (a,b) 
and Jr := [a+ 6,6 — 7] ; similarly we check the regularity for the other types of 
intervals. Finally, the condition sup{ jiz(J): J € J4} = ||z||? is implied by (5.1), 
so the result follows from Theorem 6 in combination with Proposition 3. 


5.1.8 Example: The spectral decomposition from Example 4 is associated with 
the discrete projection valued—-measure Ep : Ep(M) = 41 Xm(u;)P; supported 
by the set {u;: 7=1,...,n} — cf. Example 2a. 


Let us mention another application of Theorem 6. Suppose we have projection— 
valued measures F on JR’ and F on R?°, both with values in B(H), which 
commute mutually, 7.e., E(M)F(N) = F(N)E(M) for any M € B" and Né€ BS’. 
By Lemma 5b, this property is equivalent to E(J)F(K) = F(K)E(J) for all Je 
J’, K € J*. A projection—valued measure P on RR'*® is called the direct product 
of EF and F if P(MxN) = E(M)F(N) holds for any M € B" and N € B*. 


5.1.9 Proposition: Under the stated assumptions, there is a unique direct product 
P: BYt* —, B(H) of E and F. 

Proof: Any nonempty interval J € J7"*+’ can be expressed uniquely as J=JIxK 
with J © J’ and K € J°*, and by the commutativity assumption E(J)F(K) 
is a projection, so P defined by P(J) = E(J)F(K) for J #0 and P(0) = 0 
maps 7’+* to the set of projections on H. Using the Cartesian product properties, 
we easily check that P fulfils conditions (i) and (ii). Consider now the function 
jie := (x, P(-)x) and a pair of intervals Jag = Jax Ka € J’**, a = 1,2. We shall 
use the identity 


jte(Si) — fie(J2) = (2, (E(A1)— E(J2)|F (Ki) x) + (a, E(J2)[F(K1)— F(K2)]2) ; 


if J, SU J; D J, 80 aig —E(Jo) isa te i and the first term on the 
right side equals ||F(K1)[E(J1)— E(J2)Jal|? < uJ) — pb (Ja). The analogous 
estimate for the second term gives 


jte(J1) — fir(Jo) < wi) — uw (Jo) + wi) — pKa) ; 
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hence the regularity of ji, follows from the same property of ph) wand pf?) . oA 
similar argument with J, replaced by JR™*® shows that sup{ p(J): Jef*}= 
\|z||2, so P extends to a projection—valued measure on JR"**. 

It remains for us to prove the relation P(MxN) = E(M)F (N ) forany Me BY 
and N € B®; recall that the left side makes sense due to Example A.1.5. Using 
the identity MxN = (Mx JR’) UR"x N) and Proposition la, we see that it is 
sufficient to check the relations P(MxJR%) = E(M) and PUR’xN) = F(N). The 
first of these can be rewritten as pz(M x JR’) = u\/)(M) for all 2 € H, where the 
numerical measures juz, S") correspond to P and E, respectively. We know this 
holds for intervals, and by o-additivity, for unions of any at most countable disjoint 
system {J,} C J”, in particular, for any open set G C JR”. The measure p®) is 
regular so there is a nonincreasing sequence of open sets {G‘)} such that each of 
them contains M and 


pw (M) = lim w(G) = p(M,), 


where M, := 1%, G‘). Then {G‘) x IR*} is a nondecreasing sequence of sets 
containing Mx Re and we get p2z(M x JR’) = limp uw”) (G®) = p(M), so 
p)(M) = 0 implies u.(M x JR’) = 0. In the same way, we obtain the relation 
[r(M, x IR?) = p)(M) ; combining these two results, we arrive at 


u)(M) = p2(M;z x IR*) = pr((Mz \ M)x IR*) + w2(M x R’) = p2(Mx R’). 


This is the desired result; similarly we can check the identity PUR’x N) = F(N) 
forany NEB’. § 


5.2 Functional calculus 


Let us turn now to integration with respect to projection-valued measures. We shall 
proceed in two steps; first we restrict our attention to bounded functions. Let E 
be a projection—valued measure on JR?. The set of functions y : IR? + © which 
are E-a.e. defined and Borel will be denoted as L®(IR?¢,dE) ; as usual, we identify 
the functions which differ at most on an E-zero set. As in the case of numerical- 
valued measures, we can demonstrate that L°(IR¢,dE) is a Banach space with the 
norm ||: ||, where ||Y~l|oo is defined as the smallest c > 0 such that |y(zx)| <c 
holds for E-a.a. x € IR® (alternatively we can use the essential range of \qo|. — wef. 
Problem 5). 

The integral will now be a bounded linear mapping of the space L°(JR*, dE) 
into B(H). We shall define it first on the set of simple Borel functions o : IR¢ > € 
which we denote as Sq. If o = j;a;xm,, where {M;} C B? is a finite disjoint 
system with U; M; = JR‘, we set 


To) = f ,o(t)dE(t) = ajE(M), (5.2) 
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where ¢ in the integration variable stands for (#,...,ta). The function o can have 
different representations unless all the numbers a; are mutually different; however, 
this does not affect the right side in view of the additivity of the measure EF. We 
shall also write "Ft (y) if the dependence on the projection—valued measure should 
be stressed; the integration domain and variables will be often dropped. 


5.2.1 Proposition: The map 7: Sg — B(H) is linear and ||%(c)|| = |lolleo-. 
Moreover, the relations 7,(@) = %(o)* and T(or) = Ty(o) T(r) = Th(7)T(o) hold 
for any 0,7 € Sq. 


Proof is left to the reader (Problem 6a). 


_ The last property is called multiplicativity. Notice that when equipped with the 
pointwise multiplication, (yw)(x) := y(x)W(x), the space L°(IR*,dE) becomes 
a commutative algebra, and Sy is a subalgebra in it; using the terminology of 
Appendix B, we can state the assertion concisely as follows: J is an isometric 
*-morphism of the subalgebra Sz C L°(R?,dE) into the Banach algebra B(H). 

In the next step we extend the mapping J, to the whole space L©(JR?,dE). 
This is possible in view of Theorem 1.5.5, because J is continuous and Sy is dense 
in L~(UR*,dE) by Proposition A.2.2; hence we may define 


Ty) = [ep aB(e) = wlim K(on), (5.3) 


where {o,} C Sq is any sequence such that ||p—on||. 0. 

The “matrix elements” of this operator, (x, 7%(y~)y) = limpoo(x, Te(on)y) , can 
be expressed in an integral form; using | f y dus — f on duz| < ||~—-cnlloo||x||? , we get 
(x, %(y)x) = f pdp,, and by polarization this result extends to the nondiagonal 
case, 


(=, [ eazy) = [v(t)doey(t) (5.4) 


for any x,y € H. Roughly speaking, inner product and integration can be inter- 
changed when matrix elements of f pdE for a bounded y are computed. 


5.2.2 Theorem (functional calculus for bounded functions): The map y+ T,(y) 
defined by (5.3) has the following properties: 


(a) it is linear and multiplicative, 
(b) |Z(¥)Il = Ilvlloo and T%(%) = T(y)* for any y ¢ L°UIR*, dE), 


(c) the operator J(y) is normal for any py € L®*(IR’,dE) and o(7,(y)) = 
ACE 


(d) if {y,} C L*(IR4,dE) is a sequence such that y(t) := limpeo Yn(t) exists 
for B-a.a. t € IR* and the set { |lynllo: 2 =1,2,...} is bounded, then the 
function y belongs to L°(R*,dE) and Tj(y) = s-limn0oT(Yn) - 
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Proof: Linearity is due to Proposition 1 and Theorem 1.5.5. Let y,w € L®(IR*, dE) 
be uniformly approximated by sequences {on}, {tz} C Sg. Since the multiplication 
in L®°(IR*,dE) is continuous, we have limp. ||~~—nTrlloo = 0, and therefore 
Ty( pw) = u-limp—+scoTs(OnTn) , 80 Proposition 1 together with Problem 3.1 yields the 
multiplicativity. The definition relation (5.3) gives ||Zo(y)|| = limn—eo ||To(on)|| = 
limp—soo ||onlloo = |l¥lloo, and similarly 7(%) = T(y)* follows from Proposition 1 
and the continuity of the map B+ B* in the operator-norm topology. 

Assertions (a) and (b) further imply %(y)"Ze(y) = Tall¢l2) = Tole) B(Y)" 5 0 
the operator J(y) is normal. In a similar way, 


I(Z(y)—Azl? = (@, Blip AP)2) = fh belt) APP) ; 


by Corollary 3.4.4 anumber belongs to o(%(p)) if the infimum of this expression 
over all unit vectors x is zero. Suppose that \ € R&)(y) and denote M.., := 
yiY)((A—e,A+e)). The projection E(M.,) is by assumption nonzero for any 
€>0; at the same time it equals %(ym,.,) , 80 we get 


I(Ts(~)—Aal? = fh ket) —AP xa, .(O) da(t) < ©? 


for any unit vector z € Ran FE(M,)), i.e., 4 € o(Te(y). Conversely, let A ¢ 
R)(p) ; then M. is an E-zero set for some ¢ > 0 and pr(JR* \ Mey) = 
fe (IR*) = |\x||? holds for all z € H. This implies ||(%(~)—A)al|? > ¢*llx\/?, so 
A ¢ o(T,(y) , thus proving assertion (c). Using Problem 1.10 we readily find that 
y belongs to L®(IR*,dE). We have 


M(Z(~)-Bva)al? = fhe —ealOPdjue(t), 


and since ||~n|lo. are bounded and juz is a finite measure, assertion (d) follows from 
the dominated-convergence theorem. § 


Some simple consequences of this theorem are collected in Problems 6-8. 


5.2.3 Example: If Ep is a discrete projection—valued measure on R with a finite 
support {Aj,...,Aw}, then the operators f y(t)dE(t) have a simple form. Any 
function y € L©(dR?,dE) is in this case E-a.e. equal to the simple function with 
the values y(A;) at t = A; and zero otherwise; hence T)(y) = Pad p(A;)P; , where 
P; = Ep({A;}). 

Moreover, for any y € L*(IR,dE) we can find a polynomial Q, of a degree 
< N-1 such that 7,(y~) = 7%,(Q,,). Indeed, by Problem 6c this identity is equivalent 
to the conditions Q(A;) = yj, j = 1,...,.N , which yield a system of linear equations 
for the coefficients of Q,. Its determinant is Mie Tai (A;— Ak), so it has just 
one solution, provided the numbers A; are mutually different. Using linearity and 
multiplicativity, we can further rewrite this result as follows, 


N-1 
[eo dE(t) = Xe Ab yo [tazott), 
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where the numbers ao,...,av-1 solve the system S7~2o anAk = y(Aj), J = 1,...,N. 
In particular, this result holds for any projection—valued measure with values in a 
finite-dimensional 1. 


5.2.4 Example: Consider the projection—valued measure P on JR? which is de- 
termined by a pair of commuting spectral decompositions {F;} and {Fi} (ef. 
Problem 3c); the corresponding projection—valued measures on JR will be denoted 
as E& and F,, respectively. To a given Borel y : IR — C€ we define the function 
yg: : R? = € by pi(t,u) = y(t) ; in the notation of Appendix A.1 we can write 
this as y; = yxe. We shall show that fyidP= fypdE. 

Since yi) (K) = y'-)(K) x JR holds for any K C @, the function y is 
Borel, and by definition of the measure P, the conditions y € L~®(UR,dE) and 
gy, € L©(UR?, dP) are equivalent, and ||¥|loo = ||¥illoo. If y € L© UR, dE) , there is 
a sequence {o,,} of simple functions, ||on—y||.. + 0; then the functions o,xe are 
also simple, f(onxe)dP = fondE, and |lonxe—Y1|lo. > 0, so we get the sought 
relation, 


)dPG,u) = i t) dE(t). 
J. 9) aPtt,u) = fi ole) ant) 
The same is true for the integration over the other variable (see also Problem 9). 


The following proposition shows how the functional—calculus rules combine with 
the Bochner integral of Appendix A.5. For simplicity, we present it with rather 
strong assumptions which are, however, sufficient for our future purposes. 


5.2.5 Proposition: Let the function ~: IRxJ — (, where J C R is any interval, 
fulfil the following conditions: 


(i) ~(-,u) is a bounded Borel function for any u € J, 
(ii) w(t,-) belongs to C(J) for any te R, 


(iii) the functions y(-) := f;¥(-,u) du and n(-) := fry(t,-) dé are Borel and 
bounded, 


(iv) the operator—valued function u++ B(u) := 7,((-,u)) is continuous on J in 
the operator—-norm topology, and if J is noncompact the limits at its endpoints 
exist; moreover, ||B(-)|| € L(J). 


Then B(-) € B(J; B(H)) and 
i B(uydu = f ( I b(t, u) 4E() du = ie ( if w(t,u) du) dB(t) = Ty). 


Proof: The vector-valued function B(-) is integrable due to Proposition A.5.1. We 
denote C := f, B(u) du; then Proposition A.5.2, together with (5.4), yields 


(as) — if (0, B(u)x) du = ( ihe Y(t.) dus(t) ) de 
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for any x € H. Furthermore, 7 € C(R,duz) by assumption (iii), so ~ is integrable 
on IRxJ with respect to the product of , and the Lebesgue measure, and 


(v,Cz) = f (f-vituydu) dua(t) = ff v(t) dua(t) = (@, To(y)2) 


by the Fubini theorem; since both operators are bounded, we get C=T(y). 


Let us now pass to unbounded functions. Given a projection-valued measure 
E on Re? with values in B(H), we denote by @g(R®) the set of all E-a.e. defined 
complex Borel functions. If the equality of two elements and the algebraic operations 
have the same meaning as in the bounded case, @g(IR*) becomes a commutative 
algebra containing L°(IR?,dE) as a subalgebra. Our aim is to extend J to a 
mapping which associates with any function ~ € ®-(JR%) an operator, in general 
unbounded, with the domain 


D, = {2 EH: i: \o(t)|*dup(t) < oo } 


5.2.6 Proposition: D, is dense in H for any ¢ € ®g(JR®). 

Proof: We have ||E(M)(ax+y)||? < 2la|?ue(M) + 2uy(.\f) for any M € Bé and 
all z,y€H,a€C€,so Dy isa subspace. The sets M, := {t € IR?: |y(t)| <n}, 
are Borel and form a nondecreasing system such that E (U%, Mn) = EUR*) = 1 
because the y is by assumption F-a.e. defined. Then ||/E(\f,)x—2|| - 0 follows 
from Proposition 5.1.1b, so it is sufficient to check that 2, := E(\M,)x belongs 
to D, for n = 1,2,.... We hawe: pee(N) = ECON jell? = BN Oe pall = 
fv XM,@lx for any N € B4; hence Proposition A.3.4 gives 


fra POP due () = fle P rat dealt) <n? 


and therefore z,€ D,. Ui 


To construct the operator f y(t)dE(t) for a given p € ®g(IR*), we take a 
sequence {Yn} C L*(IR*,dE) such that yp(t) converges to y(t) and |y,(t)| < 
lp(t)| holds E-a.e. in IR? ; these requirements are fulfilled, e.g., for y° :-= px, , 
where the M,, are the sets introduced in the above proof. Let z € D, ; then 


%s(Yn)z — To(Ym)x\|? < 2 i len (t) — p(t)|?dme(t) + 2 | pe WPmrlt) — V(t) Pde (t) 
so the sequence {7;(yn)x} is Cauchy by the dominated-convergence theorem. Thus 
it has a limit in 7, and a similar argument shows that it is independent of the choice 


of sequence {yp}, i.e., that the relation 


T(y)z = lim R(y,)z, xe Dy, 
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determines a densely defined operator on H which is linear because Tj(yn) are 
linear; we call it the integral of p with respect to the measure E and denote it as 


[a9 4B) = Ty). (5.5) 


Since ||%(yn)z||? = f |~n|*due and the right side has a limit due to the dominated- 
convergence theorem, the definition implies 


IT@)al? = fle) Pdnelt) (5.6) 


- shows that D, is the natural choice for the domain of T(y). In particular, 
= Hoifeame L>UR4, dE) and T(y) = T,(y) as we check, choosing y, = y for 
i n, i.e., the mapping 7 represents an extension to J (see also Problem 10a). 
Next we shall derive functional—calculus rules for unbounded functions. Domain 
considerations make the proofs more difficult in comparison with the bounded case, 
as it already illustrates the following assertion, which extends the identity (5.4). 


5.2.7 Proposition: Let y € D, for a given y € Og(IR*) and x €H;; then y is 
integrable with respect to the complex measure Vzy := (x, E(-)y) and 


(,T(p)u) = fe) dvey(t). 


Moreover, Vz,7(y)y(M) = Sy v(t) dvey(t) holds for any M € Be. 

Proof: The integrability is equivalent to the condition f |y(t)|d|vzy|(t) < oo , where 
|Vey{ is the total variation of vzy. Let {Nz} C B* be a disjoint decomposition of a 
Borel set M. By the Schwarz inequality, |vry(Nz)| < (ue(Ne)My(Ne))!/? , so using 
the definition of |v,,| from Appendix A.4 we get the estimate 


\veyl(M) < (He (M) py(M))"/? . 


It follows from the monotone—convergence theorem that there is a nondecreasing 
sequence of non-negative simple functions on := 32; CnyXm,; Which converges to y 
everywhere in IR4, fulfils on(t) < |y(t)|, and limpioo fon d|vey| = f |p| dlvey|. At 
the same time 1; c2jMy(Mn;) < S \p(t)|? duy(t) ; then Holder inequality together 
with the above estimate imply 


Jee (| dnl (2) = itn, 3 ensy| (Mn) 


n—0o 


1/2 / 
<p, (UR*) jim, (2p < pe (IR") Ch jotta) ) 


where the right side is finite provided y € D,. To prove the first identity, we take 
a sequence {y,} C L®(IR*,dE) which satisfies the requirements of the definition, 
and use the relation (5.4), 


(,T(y)y) = lim, (@, Ts(Yn)y) = Jim, [, va(t) dvey(t) = felt) duny(t) 
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where in the final step we have employed the dominated—-convergence theorem. This 
implies in turn Vz,7(y)y(M) = (2, E(M)T(¢)y) = S e(t) dvecayzy(t) , so the remain- 
ing assertion follows from Problem 8. 


5.2.8 Example: Consider the projection-valued measure BE) : E')(M) = 

from Example 5.1.2b on L?(JR?, dg) , where @ is a given Borel measure; we shall on 
that T(f) := f f(t)dE(t) coincides for any Borel function f : J IR? — € with 
the operator T; of Example 4.3.3. We have (yp, E®(M)p) = Ju y(t)w(t) do(t) for 
any y,w~ € L?(JR*¢,do) and M € B?, so the condition J |f(t)|?|W(t)|?de(t) < co is 
equivalent to f |f (t) Paso (t) < co by Proposition A.3.4, which means that T‘)(f) 
and T; have the same oo Using the above proposition, we get 


(TAY) = [ fOaL, = [ FOPOVO dolt) = (eT) 


for any ~ € D(T;) and y € L*(IR?, do), i.e., T)(f)p =T;w for all w € D(T;). 


This example suggests why the general functional-calculus rules listed below, 
which generalize Theorem 2, are closely analogous to the properties of the operators 
T; from Example 4.3.3, and the proofs are also almost identical. 


5.2.9 Theorem (functional calculus - the general case): Let E’ be a measure on 
IR? with values in B(H) ; then the mapping T : ®g(JR*?) — L(H) defined above 
has the following properties: 


(a) homogeneity, T(ay) = aT(y) for any ae, 


(b) T(p+w) D> T(y)+T(e) , with the inclusion turning into identity iff at least 
one of the relations D,4y C Dy, or Dy4y C Dy is valid, 


(c) T(yw) D T(y)T(w), where the domain of the right side is Dy M Dy ; an 
identity is valid iff Dpy C Dy, 


(d) injectivity, T(y) = T(w) implies y(t) = W(t) for E-a.a. t € IR, 
(e) T(y)*=T(P), 


(f) T(y) is invertible iff the set Kery := y'-"({0}) is E-zero, in which case 
T(y)"? = T(¢""). 


Proof: The homogeneity is obvious. If the conditions x € D, and x € Dy hold 
simultaneously, we also have f |p(t)+(t)|?dz(t) < oo, i.e, 2 © Dysy ; then 
(y, T(e+e)x) = S(y(t) + ¥(@)) dvzy(t) = (y, (T(p) +T(¥))x) holds for any y € H 
by Proposition 7. The remaining part of (b) follows from the relations D, A Dy = 
Dory~N Dy and DgNDy = Dey Dy which are easy consequences of the inclusion 
io” N Dy = Doss j 

A vector x belongs to the domain D of the product T(y)T(w) iff the con- 
ditions f |w(t)|?du2(t) < oo and f |p(t)|*duriy)2(t) < oo are valid simultaneously; 
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the second of these can be rewritten as f |\p(t)|?|¢(t)|?due(t) < oo (Problem 11a), 
i.e, x € Doy. Together we get D = Dy Dyy, and the two sets are equal if 
Dey C Dy. By Proposition 7, 


(WT(~)TW)z) = fel) dryrelt) = felt) dyelt) = (vs T(e¥)2)- 


holds for all 2 € D and y € H, which proves (c). 

Let M,, be the subset of IR? on which both y, # are defined; to prove (d), 
it is sufficient to show that the set N := {t € My: p(t) ¥ Y¥(t)} is E-zero. 
This can be expressed as the union of the nondecreasing sequence of its subsets 
Nn = {t © Myy : |p(t)—¥)| = 0, |y(0)| <n}, s0 E(N) = slim E(Nn) 
amd the relation HUN) = 0 holds if zy := E(N,)z = 0 for all n and z € H. 
Clearly zt, C Dy for all n, and since D, = Dy by assumption, assertions (a) and 
(b) give T(y—~w)z, = 0. Now the relation (5.6) implies 


0 = IT(v—Wanl? = f° |o- ole) Pduelt) = 0-78 Q)ar 


fomeny on, t.c., E(N,)xz = 0. 

The adjoint T(p)* exists because D, = Dg is dense in H ; using Theorem 2b 
we easily find that T(%) C T(y)*. It remains for us to prove that if (y,T(y)x) = 
(z,x) holds for some y,z € # andall x € D,, then y € Dy. Weset yr := Ti(m)y, 
where mm := yxm, and M, are the sets from the proof of Proposition 6; then 
Yn € Dg since J |p(t)|2djty,(t) = Sr4q pCt)I*dpiy(t) < 00. Next we set 2 = Yn ; 
then the Schwarz inequality gives |(y,7(~)yn)| < |lzll llyn||. The left side can be 
rewritten by Proposition 7 as 


(v,T(p)m) = fmm ® aey(t) = IIvall?. 


Thus we get f |nn(t)|?duy(t) < |/z||?, n = 1,2,..., which means that y € D, by the 
Fatou lemma. 

Finally, if E(Kery) = 0 then the relation #(t) := y(t)~) for t € IR? \ Kery 
defines a function ~ € @g(IR4%). Since y(t)p(t) = 1 holds FE -a.e., assertion (c) 
implies D(T(#)T(y)) = D, and T(w)T(y)x = x for all x € Dy, and the same 
relations with the roles of y,w interchanged; hence the operator T(y)~} exists 
and T(y)-! = T(wW). To check the opposite implication, we employ the iden- 
tity YXKery = 0, which yields by (c) the relation T(p)E(Kery) = 0 from which 
E(Ker y) = 0 follows if T(y) is invertible. J 


5.2.10 Example: Let Qy be a polynomial of N-th order, Qy(z) := Dp Qn2z”, 
with complex coefficients such that an #0. If y € L©(IR4,dE), the assertions 
(a)—(c) of the theorem yield 


N N 
T(Qvop) = T bs an" = Lantos 
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with a little more effort one can show that the same is true for any y € ®g(JR*). 
Indeed, let x € D(y”) (we use the alternative notation D(y) := D, whenever it 
is convenient), then z € D(y™) for m <n by the Hélder inequality because the 
measure jl; is finite, i.e., D(p") C D(p™) if n > m, and therefore 


T(p") = T(p)” 


follows by induction from Theorem 9c. Furthermore, due to the assumption there 
is a positive c such that 4 < |Qy(z)ay'z~™| < 3 holds for all |z| > c, so Prob- 
lem 10c gives D((Qnov)xm-) = D(y’ xme), where we have dencted M*° := {t € 
R* : |y(t)| > c} ; using the fact that both functions are bounded on IR* \ M¢ 
we conclude that D(Qnoy) = D(y%). Theorem 9b now implies T(Qyoy) = 
ayT(y)% + T(Qmoy), where Qm is a polynomial of a degree < N —1 ; the 
desired formula is then obtained by induction. 

If f € ®gUR*) is a real-valued function and the polynomial coefficients are 
real, this result can be combined with Theorem 9e showing that the operator 
2 (nT (f)" is self-adjoint for any real co,c1,...,¢n ; in particular, any power of 
T(f) is self-adjoint. 


Theorem 9 implies other important properties of the operators T(y). 


5.2.11 Theorem: (a) T(y) is normal for any y € ®g(JR*,dE) and the spectrum 
o(T(p)) = Ri2(¢). 


(b) A complex number 2 is an eigenvalue of T(y) if E(y'-)({A})) 4 0, in 
which case the corresponding eigenspace is Nz(,)(A) = Ran E(y'-)({A})). 


(c) -T(y) is self-adjoint iff y(t) € IR for E-aa. t € R?. 


(d) If B € B(H) commutes with the measure EF, then BT(y) C T(y)B for any 
function y € ®g(JR?). 


Proof: We have T(y~) = T(%)*, so T(y) is closed. Furthermore, D, = Dz 
and (5.6) implies ||7(y)z|| = ||T(y)*z|| for all x € D, ; hence T(y) is normal by 
Theorem 4.3.1. The remaining part of (a) is obtained in the same way as Theorem 2c. 
To prove (b), we denote My := y'-)({A}). If a nonzero z fulfils E(M))r = x, 
then fipa |(p(t)/?due(t) = Sy, |p(t)|?dur(t) = |Al?||r]]?, so 2 € D, and the relation 
(5.6) together with Theorem 9b gives 


I(T(e)—Aal? =f Ielt)—AlPdua(t) = 0, 


i.e, A€o(T(y)) and Nziw(A) D Ran E(M,). On the other hand, let x € Dy, be 
a unit vector such that T(y)z = Ax. This implies f |y(t)—Al|*du,(t) = 0, which 
means that p,(IR* \ My) = 0, and therefore (My) = ||z||?, i.e., E(My)x = z. 
Assertion (c) follows from (d) and (e) of Theorem 9. Finally, the commutativity 
of operator B with E(M) for any M € B* implies vy52 = vgey2, and in a 
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similar way, we get ua2(M) < ||Bl|?u2(M). The last relation shows that D, is 
B-invariant; then 


(y, T(y)Ba) = | v(t) dy,2e(t) = (B'y, T(y)z) = (y, BT (p)e) 


holds for any x€ D, andallycH. 


In conclusion, let us mention how composite functions are integrated with re- 
spect to a projection—valued measure (see also Problem 15). 


5.2.12 Proposition: Suppose that EF is a projection—valued measure on JR? and 
w ; IR? — JR” is such that its “component” functions w; : R¢ > IR, j =1,...,n, 
are Borel. Define Ey: Ey(M) = E(w'-(M)) ; then the composite function yow 
belongs to ®g(JR*%) for any y € Sz, (JR") ; and 


ie y(t) dE, (t) = I (vow) (t) dB(t) 


Proof: Let yp be defined on IR" \ N , where E,,(N) =0; then w := pow is defined 
on w'-)(JIR"\ N) , whose complement is an E-zero set; using the properties of Borel 
sets, we can check that w'-)(M) = w'-))(y'-)(M)) belongs to B? for M € B, so 
w is Borel, i.e., ~ € ®g(IR*). Theorem A.3.10 easily implies Di as De, and 
the relation (x, T‘™“)(p)x) = (x, T)(yow)zx) for all « € DY) , which extends by 
polarization to any pair z,y € Jo eat , and since this domain is dense in 1, we get 
T~)(~) c T')(yow) ; however, the two operators are normal, and therefore equal 
each other (cf. Remark 4.3.2). I 


5.3 The spectral theorem 


The main goal of this section is to prove the spectral theorem for self-adjoint 
operators, which in a sense represents a cornerstone of the theory. Let us first 
formulate it. 


5.3.1 Theorem: (a) To any self-adjoint A, there is just one projection—valued 
measure EF, such that 


A® A tdE,(t). (5.7) 


(b) A bounded operator B commutes with A iff it commutes with EMA) = 
Ea(—oo,t] forall te R. 


The formula (5.7) is usually called the spectral decomposition of the operator A. 
The proof will proceed in several steps. We start by checking the assertion for 
Hermitean operators; from them we pass to bounded normal ones, in particular, to 
unitary operators, and in the last step we shall employ the Cayley transformation. 
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Therefore let A be a Hermitean operator on H ; we denote the interval between 
its lower and upper bound by J4 := [ma,Ma]. To any polynomial P with real 
coefficients, P(t) :-= Lj=o cjt?, we can define the Hermitean operator P(A) := 
Djx0 7A’ , where a= 1. We can easily check that the map P ++ P(A) is linear 
and redwlientviere, and also monotonic, which means that P(A) > 0 if P(t) > 0 on 
J, (Problem 16b). Our aim is extend this to a wider class of real-valued functions 
on J, in such a way that the mentioned properties will be preserved. 

Consider first the family Ky of functions f : J4 — [0,00) such that there is a 
nondecreasing sequence {fn} C C(J4) which converges pointwise to f ; it is obvious 
that f is closed with respect to sums, products, and multiplication by positive 
numbers. Instead of continuous functions, the elements of Ko can be approximated 
from above by polynomials. 


5.3.2 Proposition: If f € Ko and {f,} C C(Ja) is a sequence with the stated 
properties, then there is a sequence {P,} of polynomials such that f,(t) < F,(t) 
and P,(t) <P.is(t) for any te J, aie w= 1,2,..., aoa, B= Fa). 
If {Q,} is another sequence of polynomials with these properties, then for any n 
we can find m, so that Qm(t) < P,(t)+n7! and Pn(t) < Qn(t) +n7! holds for 
any t€ J, andall m>my. 


Proof: By the Weierstrass theorem, the function gn := fn +3.2-"-* can be approx- 
imated by a polynomial P, so that |g,(t)—P,(t)| < 2-"-? holds for any t € Ja, 
a.e., 2°"! < P,(t)—f,(t) < 27" ; this proves the first assertion. In the remaining 
part the polynomials appear symmetrically, so it is enough to prove one of the in- 
equalities. Since Qn(t)—Pm(t) - 0, for any n and t € J4 we can find a positive 
integer m(n,t) such that Qm(t)—Pn(t) <n7! for all m > max{m(n.t),n}. Since 
{P(t)} is nonincreasing, we have Qm(t) < P,(t)+n7! ; the inequality then follows 
from Problem 17. § 


The monotonicity condition here is important, because due to it the sequence 
{P,(A)} is nonincreasing, and since it consists of positive operators, it converges to 
a positive P := s-limn.P,(A) (ef. Remark 3.2.5). Successively performing the 
limits m — oo and n — oo in the inequalities Qm(A) < P,a(A)+n7! and Pp(A) < 
Qn(A) +n7! which follow from the proposition, we see that the limiting operator 
is independent of the choice of the sequence {F,} approximating the function f ; 
in this way, we get the map Ty : Taf = s-limpioo.P,(A) from Ko to the set of 
positive operators on H. It has the following simple properties, the proof of which 
is left to the reader (Problem 16a). 


5.3.3 Proposition: The map Ty is additive, multiplicative, and Ty (cf) = cT4(f) 
for any f € Ko and c > 0. If functions f,g € Ko satisfy f(t) > g(t) for all 
t € Ja, then T4(f) > Ta(g). 


The map T, does not extend P + P(A) because the polynomials assuming 
negative values on J, are not contained in Ko , but there is a common extension to 
these two maps. We introduce the set K consisting of the functions y : J4 > R 
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which can be expressed as y = y,—y_ forsome ys € Ko. We can easily check that 
K is a real algebra with pointwise multiplication, which contains all real functions 
continuous on Jy ; in particular, a restriction to J4 of any real polynomial belongs 
to K. Given a function y € K, we ascribe to it the Hermitean operator 


Ta(y) = Ta(p+) — Ta(y-) ; 


the definition makes sense, because if » = ¢, — @_ is valid at the same time, then 
the additivity of Ta implies T4(ys) + Ta(G_) = Ta(G4) + Ta(e_), 80 Taly) = 
Ta(G1) — Ta(-). 


5.3.4 Proposition: The map T' is linear, multiplicative, and monotonic. For any 
y € K, there is a sequence {P,} of real polynomials such that y(t) = limp—eo Pa(t) 
for any t € J, and Ta(yp) = slimp..P,(A). 

Proof: The additivity is obvious; to prove the linearity, we have to decompose ay 
into the difference of cay, and cap_., where € := sgna, and in a similar way 
we can check the multiplicativity. If » < ~ on Jag, then yi tv. <a +y_, 
which implies T4(y) < T4(w#) by Proposition 3. Finally, if {P‘*)} are sequences 
approximating the functions yi, we set P, := P{+) — P'-) ; it is easy to check that 


The map T, extends both T, and P++ P(A) ; to check the last assertion it 
is sufficient to realize that any polynomial can be written as a sum of a polynomial 
positive on J, and a constant. 


5.3.5 Proposition: For any u € R, define ey := X(-coujnJa \ Ja ; then the map 
ut E,, := Ta(eu) is a spectral decomposition on H. 

Proof: Each of the operators E,, is positive and E? = E, follows from the multi- 
plicativity of T,,so0 E,, is a projection; using further the monotonicity of T,, we 
get Ey, < E, for u<v. The relations (5.1) are satisfied trivially; hence it remains to 
check that u+-> E,, is right continuous at any u € [m,4,M,). Consider the continu- 
ous functions gn := max{0, min{1, 1—n(n—1)(.—u—n7!)}} for n > (Mg—u)7!4+1 
which fulfil the inequalities gnii(t) < eutim(t) < gn(t) for each t € Ja, and 
limp—eo 9n(t) = eu(t). Let PF, be polynomials approximating g, according to 
Proposition 2; then Ey, = s-limpooPn(A) and P,(A) > Eutijn > Eu; however, the 
operators Fusijn— Ey are projections, so (%,(Pa—Exy)x) > ||(Lusin—E all? for 
all 2 € H, and therefore s-limpsoo Putin = Eu. 


With these prerequisites, we are able to finish the first step of the proof. 


5.3.6 Proposition: The spectral theorem holds for any Hermitean A, and in this 
case the measure FE, is supported by Jy. 

Proof: Let E4 be the projection—valued measure generated by {T4(eu)} . Obviously 
E,(—oo,m,) = Ea(Ma,oo) = 0 and the function id : id(z) = x belongs to 
L®(IR,dE,4) ; we shall show that 7j")(id) = A. Let D be any division m4 = 
jo <a... <edge—4, ofethe inteeval: J,4. We denote Lp := [to,ti] and 
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Les (deer te], K = 2,...,25 them yx, |} Ja = eg aid yxy WJa="e, eae, “for 
k = 2,...,n, so the characteristic functions yz,, k = 1,...,n belong to K in 
view of Problem 16f and T4(x1,) = Ea(Le). Furthermore, the simple function 
sp := Wh, texz, also belongs to K and Ty(sp) = Dh teTa(xn,) = TK” (sv) 
by (5.2). Given € > 0, we can choose D such that maxicp<n(te—te-1) < € ; then 
7") (id)-T}"” (sp) || < € by Theorem 5.2.2b, and at the same time, ||T4(id }Ja)— 
Ta(sp)|| < € by Problem 16d. Since T4(id}J4) = A due to Problem 16e, the last 
two inequalities give ||7;{"*)(id) — All < 2e, and therefore Te) (id) = A. 

Suppase that F is another spectral measure corresponding to the operator A, 
then 7, m (f) = Ta(f Ja) holds by Problem 18a for any real function f whose 
restriction to Ja belongs to Ko. In particular, choosing f := X(-co,u) for u > ma 
we get F(—o0, u] = E4(—oo,u]. The same is obviously true for u < ma, so the 
two measures are identical, F = E,4, in view of Problem 3d. 

The condition BE“ = EM B,t € R, implies by Problem 3c BE,4(M) = 
E,(M)B for any Borel set M ,so BA = AB follows from the functional—calculus 
rules. To prove the opposite implication, we employ Proposition 4. Toany te R, 
there is a sequence {P)} of polynomials such that E{”’ = s-limpeo P(A) ; the 
condition BA = AB then implies BP‘ (A) = P\(A)B and the limit n — oo 
concludes the proof. § 


Now let B be a bounded normal operator, B = A; + iA2. Its real and 
imaginary part commute, A,;A2 = A2A;, so the identity E4,(M\)E4,(Mfo) = 
Ea,(Me2)Ea,(Mi1), Mi, Me € B, is valid for the corresponding projection—valued 
measures. We denote their direct product as Fg and use the natural isometry 
which allows us to identify IR? and ©. Since Aj, A are bounded, the function 
idc : ide(z) = z belongs to L®(€, dF), and the relation 


ib zdFp(z) = Ar +iAg = B (5.8) 


follows from Problem 9. This yields another modification of the spectral theorem. 
5.3.7 Proposition: Let B be a bounded normal operator; then 


(a) there is a projection—valued measure F'g on € supported by o(B) for which 
the formula (5.8) is valid. A projection—-valued measure F on C€ such that 
Tide) = B and F(M, x Mg) = E4,(M,)Ea,(M2) for all My,M> € B 
coincides with Fz, 


(b) anumber  € €' belongs to the spectrum of B iff Fs(U.) #0 holds for any 
e-neighborhood of it; furthermore, 4 is an eigenvalue iff Fg({\}) #0, and 
in that case the corresponding eigenspace is Ng(\) = Ran Fp({A}). 


Proof: We have already checked part (a) with the exception of the support claim; 
assertion (b) follows from Theorem 5.2.11. The resolvent set p(B) is open and 
due to (a) any point of it has a neighborhood U such that Fg(U) = 0. Hence 
(x, Fg(p(B))x) = 0 holds for all x € H (see Appendix A.2),so Fa(p(B))=0. & 
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In the next step we pass to unitary operators. We know that a unitary operator 
U is normal and its spectrum is a subset of the unit circle C:={zeC: |z|=1}, 
which means that Fy(€ \ C) =0. The circle can be parametrized by Re z = cost 
and Imz = sint with ¢ € (0,27) ; using this we define the function w: C > R 
which maps €\C to zero and w(z) = ¢t at the circle points. It is easy to check that 
w is Borel; in view of Problem 15a, this determines a projection—valued measure on 
IR by Ey(M) = Fy(w-)(M)) for all M € B. It is obvious that Ey(—oco,0) = 
Ey(2r, 00) = 0, and Propositions 7a and 5.2.12 imply 


U = ie e dEy(t) ; (5.9) 


hence we get the spectral theorem for unitary operators (see also Problem 20). 


5.3.8 Proposition: For any unitary operator U, there is a projection—valued 
measure Ey on R such that (5.9) is valid. Moreover, the spectral decomposition is 
unique, 2.e., if E is a projection—valued measure fulfilling E(—oo,0) = E[27, 00) = 
0 and U= fre“ dE(t), then E = Ey. 

Proof: It remains for us to check the uniqueness of the spectral measure which is 
equivalent to the relation f yo,q(u)dE(u) = f xjoq(u)dEu(u) for all t € (0,27). 
If there is a measure E with the stated properties, then Theorem 5.2.2 implies 
fT(u) dE(u) = {T(u)dEy(u) for any trigonometric polynomial, and furthermore, 
fwv(u) dE(u) = fu) dEy(u) for any function ~: IR — € which is continuous 
and 27—periodic due to Problem 6a and the Fejér theorem. In particular, we can 
choose for y~ the piecewise linear function which is 27—periodic and is equal to 1 
and 0 in the intervals [0,t] and [t+1/n,21r—1/n], respectively; taking the limit 
n — co and using Problem 6a once more we get the sought identity. [J 


5.3.9 Example (spectral decomposition of the FP-operator): Consider the Fourier- 
Plancherel operator F,, on L?(IR"). If n =1, it has a pure point spectrum due to 
Problem 2.6, o,(F’) = {1,—7,—1,i}, and by induction we can check that the same 
is true for any n. The corresponding spectral measure is then 


3 
Ep, : Ep,(M) = Yo xw-0ca(e™”) Pr, 
k=0 


where the projections P, := Ep, ({%*}) refer to the eigenvalues e™*/2 and w is the 


function used above; it is discrete and supported by {0, %,7, 2}. By Problems 20b 


and 22, the eigenprojections may be expressed as Py = a = OF, and using the 


argument of nese 5.2.3, we get a system of equations a ae westieais which 
is solved readily by cl!) = i(- —i)* ; this yields the formula 


ice k TC aLage 
Bry(M) = 7 xm (S) "RE 
j,k=0 


for any Borel set Mc R. 
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In the final step, we combine Proposition 8 with the Cayley transform of a 
self-adjoint operator A. We start with an auxiliary result. 


5.3.10 Proposition: Suppose that A is self-adjoint and EF is a projection—valued 
measure on JR such that the relation (5.7) is valid. Then E(w'-))(-)), where 
w(t) := 7 +2arctant, is the spectral measure of the unitary operator C(A). 
Proof: Since w is continuous, F := E(w‘-!)(-)) is a projection—-valued measure 
due to Problem 15a, and moreover, F(—oo,0] = F[21,0co) = 0. The function 
n: n(s) = e% belongs to L*(IR,dF) and e* = (w7!(s)—i)(w7*(s)+7)~!. Using 
Proposition 5.2.12, we get 


is ja 
fe dig vie i t+i ge); 
t-i 


it remains to prove that the right side, which we can write as T(p) with y(t) = = 
equals C(A). The operator T(y) is unitary by Problem 6c. Since A is self- 
adjoint, for any y € H there is z € Da such that y = (A+i)z = T(id + 1)z, and 
T(y)T(id +17) = T(id — i) due to Theorem 5.2.9c because the two operators have 
the same domain Dg. This yields (x, T(y)y) = (x, (A—i)z) = (x, (A—i)(A+i)~!y) 
for any x€H,i.e., T(y)=C(A). FE 


Now we are ready for the proof of the spectral theorem: Let U := C(A) be the 
Cayley transform of A and F the spectral measure of U. Due to Theorem 4.7.7b, 
A = 1 is not an eigenvalue of U, so F({0}) = 0 by Problem 21, and therefore 
F(—oo, 0] = F[27,00) = 0. Consider the function v: JR — R defined by v(s) := 
tan(*5") for s € (0,27) and v(s) := 0 otherwise; it is easy to see that it is Borel and 
v(s) = tan(*5*) F-a.e. in JR. Using the functional-calculus rules in combination 
with (5.9) and the inverse Cayley transformation, A = ([+U)(I—U)7}, we get the 
relation A = i T‘P)(14y)TP)((I-n)7!) c TWP) (i it2) , where we have denoted again 


n(s) := e. This further implies A c T*)(v) , because i = tan(*5*) provided 
s is not an integer multiple of 27 ; however, both operators are self-adjoint, and 
therefore equal each other. The relation (5.7) now follows from Proposition 5.2.12 
if we define Ey := F(v'-)(-)). 

The uniqueness of the spectral decomposition is a consequence of Propositions 8 
and 10. The condition BA C AB implies BU=UB by Problem 4.44b, and since 
E{” = F(0,w(t)| = F(—00, w(t)] we get ES) B = BE due to Problem 20c. The 
opposite implication is checked in the same way as in Proposition 5.3.6. [ff 


Part (b) of Theorem 1 allows us to extend the notion of commutativity to any 
pair A, A’ of self-adjoint operators, in general unbounded: we say that they com- 
mute if the corresponding spectral measures EF and E’ commute. By Problem 3c, 
this is further equivalent to the condition E,E) = ElE, for all t,s € IR, which is 
due to the spectral theorem valid iff A commutes with {E/} and A’ commutes 
with {£,}. Taking Proposition 6 into account, we see that if one of the operators, 
say A, is bounded the commutativity is equivalent to AA’ C A’A, i.e., the new 
definition is consistent with the old one (see also Problem 23b). 
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Theorem 1 has a purely existential character; it tells us nothing about how the 
spectral measure can be found for a particular self-adjoint operator A. To solve this 
problem, we either have to guess the spectral measure and prove afterwards that 
it satisfies the relation (5.7), or find it using another self-adjoint operator whose 
spectral decomposition is known. 


5.3.11 Examples: (a) Let A be a self-adjoint operator on a separable H with a 
pure point spectrum, o,(A) = {A;: 7 = 1,2...}, where the eigenvalues A, 
correspond to a complete system of eigenprojections P;. Let EF: E(M) = 
; XM(A;)P; be the discrete projection—valued measure from Example 5.1.2a; 
we shall show that T“)(id) = A. The set IR \ op(A) is f--zero for any 
x €H; hence the domain of T‘”)(id) consists of the vectors which satisfy 


ff duslt) =X % (@, Pix) < 00, 
4] 


and (r,T)(id)x) = 30; A; (az, Pjxz) holds for any Djg in view of Proposi- 
tion 5.2.7. The projections P; reduce the operator A by Example 4.4.3, 
P;A Cc AP;, so ||Ax||? = Xj A? ||Pjxl? < co holds for any « € Da, ie., 
Da C Dia. In a similar way, we check the inclusion A C T)(id) , and since 
the operators are self-adjoint, they equal each other. The action of the ope- 
rator A given by the right side of (5.7) can in this case be expressed explicitly 
as Ar = )0; Pj/Ax = 0; APjx = 0, A;Pjx for any xe Da. 


(b) Let E be a projection-valued measure on JR* and w : IR? > R a Borel 
function which is defined E-a.e. The operator fpw(t)d#(t) is self-adjoint 
and E,, := E(w‘-)(-)) is its spectral measure due to Proposition 5.2.12. 


(c) Suppose that A is self-adjoint and U is unitary, then A’ := UAU~1is also self- 
adjoint and the spectral measures of A, A’ are related by E’(-) = UE(-)U—} 
in view of Problems 4 and 12. 


(d) Similarly, let A be reduced by a projection P ; then its part A’ := A) PD, 
is again self-adjoint and its spectral measure is given by E’(-) := E(-)) PH 
due to Problems 3e and 13. 


5.4 Spectra of self-adjoint operators 


The existence of the spectral decomposition for a self-adjoint operator A provides 
a tool for characterizing and classifying the spectrum. To begin with, let us collect 
some results which follow immediately from Theorem 5.2.11. 


5.4.1 Proposition: Let A be a self-adjoint operator and a real number; then 


(a) AG o(A) iff Ea(A—e,A+e) £0 for any e>0, 
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(b) A €o(A) iff Ea({A}) £0, and Na(A) = Ran (E4({A})) is the correspond- 
ing eigenspace. Any isolated point of the spectrum is an eigenvalue, 


(c) the set o(A) is nonempty and IR\o(A) is E,4-zero; the operator A is bounded 
iff its spectrum is bounded. 


5.4.2 Remarks: (a) The fact that o(A) is nonempty represents an important 
result of the spectral theory; recall that in distinction to the bounded case 
covered by Theorem 1.7.5, an unbounded closed operator can have an empty 
spectrum — cf. Examples 1.7.6. Assertion (c) also allows us to write the 
formula (5.7) as 


fa ii ay (ABAD 


this shows that the numerical range of A satisfies the relations infO(A) = 
inf 0(A) and sup 0(A) = supo(A). Moreover, the spectrum is the minimal 
closed set with an E,4-zero complement, i.e., the spectral measure EF, is 
supported by o(A). 


(b) The spectral measure of a single point can be written using the spectral decom- 
position {EB } as Ea({A}) = i a .. then it follows from assertion (b) 
that o,(A) consists just of the discontinuity points of the map t > EM). 

Furthermore, the residual spectrum of a self-adjoint operator is empty, so (a) 

in combination with the decomposition (1.6) shows that o,(A) consists of the 


points in which tt EM is nonconstant but continuous. 


The essential spectrum of a self-adjoint operator A is related to the dimension- 
ality of its spectral measure. 


5.4.3 Theorem: (a) A real number 2 belongs to the essential spectrum of A iff 
dim Ran E4(A—e€, A+¢) = 00 holds for any ¢ > 0. 


(b) oess(A) is a closed set. 


Proof: Let dim Ran E4(A—e€,A+¢€) = oo for any € > 0, and consider the se- 
quence of neighborhoods U, := (A—1/n,A+1/n). Proposition 5.1.1b implies 
S-liMpooH4(Un) = Ea({A}) ; if there is an 7m such that E4(U,) = E4(Ua) for 
all n > 7m, then dim E4({\}) = dim E4(U;) = 00, so 2 is an eigenvalue of in- 
finite multiplicity. In the opposite case we can choose a subsequence {U;,,} such 
that E4(Un,,,) # Ea(Un,) for k = 1,2,...; since the sets A, := Uses, \ Cea Bite 
disjoint, the projections Ey := E4(A,) form an orthogonal family. We pick a unit 
vector x, in each Ran Ey; then ||(A—A)zell? = fy, |t—-Al?dun, (t) < ng?. In view 
of the orthonormality, {a,} has no convergent subsequence, so again \ € Ces) 

Assume on the contrary that the condition is not valid; then \ is not an 
eigenvalue of infinite multiplicity; in view of Proposition 4.7.13 we have to show 
that A is a regular value of the operator A, := A)(Na(A))+. By assumption, 
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dim Ran E4(U-(A)) < oo for some € > 0 ; hence there is a positive 6 < € 
such that B4(A—6,A) = E4(A,A+6) = 0 (cf. Example 5.1.4), and therefore 
(Na(A))+ = Ran(I—E,4(Us(X))). The operator A) is self-adjoint due to Prob- 
lem 4.24b, so A € p(A,) iff there is a c > 0 such that ||(A,—A)z||? > ella|l? 
for all x € D(A)), ie., all z € Dg fulfilling the condition F{4(Us(A)) = 0 ; the 
functional-calculus rules show that the inequality is valid with c= 6. This proves 
assertion (a), which in turn easily implies (b). 


Another useful criterion combines a modification of the definition from Sec- 
tion 4.2 with topological properties of the essential spectrum. 


5.4.4 Theorem: Let A € RR; then the following conditions are equivalent: 
(a) A € Gese(A) , 


(b) there is a sequence {x,} C Da of unit vectors which converges weakly to zero 
and limp—+oo ||(A—A)znl| = 0, 


(c) X is an accumulation point of o(A) or an eigenvalue of infinite multiplicity. 


Proof: In the previous proof we have shown that (b) follows from (a); recall that any 
orthonormal sequence converges weakly to zero. On the other hand, a sequence {z,} 
which satisfies condition (b) contains no convergent subsequence, since xp, — 0 
contradicts the requirement ||z,, || = 1 for all k. It remains for us to show that (a) 
and (c) are equivalent. If condition (c) is not valid, then either A ¢ o(A) or it is 
an isolated point of the spectrum with dim Ran E4({A}) < 00, so » ¢ Gess(A) due 
to Theorem 3. On the contrary,  ¢ Gess(A) implies dim Ran F4(U.(A)) =: n < 00 
for some € > 0. In the case n = 0 we have X ¢ o(A) ; otherwise it is an isolated 
point of the spectrum with a multiplicity not exceeding n. & 


The complement a4(A) := 0(A) \ @ess(A) is called the discrete spectrum of 
the operator A ; it consists of all isolated eigenvalues of a finite multiplicity. If the 
essential spectrum is empty, o(A) = op(A), we say that operator A has a purely 
discrete spectrum. Any such operator has a pure point spectrum but the converse 
is not true (Problem 24). The fact that oess5(A) = @ can be expressed equivalently 
by means of the operators (A—A)~! (see Problem 25b); this is why instead of a 
purely discrete spectrum we often speak about operators with a compact resolvent. 

Theorem 4 easily implies that the essential spectrum does not change if we add 
a compact Hermitean operator to A (Problem 26). It appears that this stability 
result remains valid for a much wider class of perturbations. An operator TJ, in 
general unbounded, is said to be relatively compact with respect to a self-adjoint 
A, or briefly A-compact, if Dr > Da and the operator T(A—i)~! is compact; 
in such a case the first resolvent formula together with Theorem 3.5.2 shows that 
TRa(A) is compact for any 4 € p(A). 


5.4.5 Proposition: Suppose that A is self-adjoint and T is A-compact; then 
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(a) a sequence {rn} C D(A) which satisfies ||zn||?+ ||Avnl]? < c for a positive 
c and all n = 1,2,... contains a subsequence {xp,} such that {Txp,} is 
convergent, 


(b) if A+T is self-adjoint and the operator T is (A+T)—bounded, then it is also 
(A+T)-compact, 


(c) if the operator T is symmetric and A-compact, then A+T is self-adjoint and 
T is (A+T)-compact. 


Proof: We set. Yn := (A—1)am. The parallelogram identity gives ||yn||? < 2\lrn||?+ 
2\|Azp||?, so {yn} is bounded and by Theorems 1.5.14 and 3.5.2 there is a convergent 
subsequence {T'Ra(i)¥n,} = {TXn,}. To prove (b), we take a bounded sequence 
{yn}. The vectors zp := Rair(i)yn belong to D(A+T) = D(A) and there are real 
a, 3 such that ||Tz,||? < a7\|(A+T)2nl|\?+ G?||zn{|? holds for all n. Furthermore, 
|(A+T) Zl] = |lyntiznl|? < 2llyn||?+ 2||Ra+r(2)|I?llynll? ; these two inequalities show 
that the sequence {||zn||?+ ||Azn||?} is bounded, so TRa+r(i) is compact by (a). 

Next we employ the inequality ||Tx|| < an(\|Az||+n||x|]) with ap := ||TRa(in)||, 
which is valid for any x € D(A) and n=1,2,...; we shall show that a, — 0. We 
define B, := I — i(n—1)R,(—in) = (A+i)R,4(—in) ; then the functional—calculus 
rules in combination with Problems 8 and 23a yield 

+1 
Baul? =f lt+iPdpayimylt) = fa duylt) 

for any y € H, and therefore ||B,y|| — 0 by the dominated-convergence theorem, 
i.€., SlimpoB, = 0. The compactness of TRa(i) implies that its adjoint is also 
compact; then |/TR4(i)B*|| = ||Bn(TRa(i))*|| ~ 0 by Problem 3.32, and since 
Br = 14+ t(n—-1)Ra(in) = (A-i)Ra(in), we get a, = ||TR4(i)B*|| — 0. Hence 
T is A-bounded with zero relative bound; in particular, it is self-adjoint by the 
Kato-Rellich theorem. Choosing n large enough, we have a, < - and therefore 
[Tx|| < ||Axl| - |[Tx|| + nllz|| < ||(A+T)z + nllz|| for any x € D(A), 2.€., T is 
(A+T)-bounded; to conclude the proof, we have to apply assertion (b). 


5.4.6 Theorem: The essential spectrum of a self-adjoint A is stable with respect 
to a symmetric A-compact perturbation T, 7.e., oess(A+T) = Gess(A) . 

Proof: It is sufficient to check that Tx, — 0 for any sequence {z,} which sat- 
isfies condition (b) of Theorem 4; in view of the compactness of TRa(i), this is 
equivalent to w-limp...(A—1)tn = 0. Since w-limy..%n = 0 by assumption, we 
have ((A+%)y, tn) = (y,(A—1)an) 30 for any y € D4. Let » € Oes5(A) ; then 
the sequence {(A—2)z,} is bounded because limsup ,_,..||(A—i)xnl| < |A—i}, and 
therefore w-limp—oo(A—1)%n = 0 due to Theorem 1.5.12c. In this way, we get 
Oess(A) C Oess(A+T), and since Proposition 5c allows us to switch the roles of A 
and A+T', the two sets are equal. J 


Spectral decomposition provides another way to decompose the spectrum of a 
self-adjoint operator A which is based on the relations of the measures pz := 
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(x, E4(-)x) to the Lebesgue measure on JR. Let Bo be the family of all Borel sets 
of zero Lebesgue measure; then we introduce 


/) 
ty 


{zr EH: pr(N) =0 forall N € Bo}, 
{zEH: thereis N, € By such that p.(IR\ Nz) =0}; 


using the terminology of Appendix A.3, we may say that Ho. and H, consists 
just of the vectors for which the measure pt, is respectively absolutely continuous 
and singular with respect to the Lebesgue measure. Alternatively, we can use the 
fumetionsg,: o;(t) = (x, Ez). In the case of © € Hac we have pz(J) = 
oz(b) — oz(a) for any interval J with the endpoints a < b, and therefore Ha, 
consists of the vectors x € H for which the function o, is absolutely continuous. 


5.4.7 Proposition: The subspaces H,. and H, are closed and mutually ortho- 
gonal, H = Ha. @ H,;, and moreover, they reduce the operator A. 

Proof: Clearly x € Hae iff Ea(N)x = 0 for all N € Bo, and yEH, iff 
Ea(Ny)y =y for some N, € By. Then (x,y) = (a, Ha(Ny)y) = (Ea(Ny)2,y) =0, 
so Hae C H+. To check that this is in fact an identity, we take any N € By and 
ze€ Hi. Since E,(N)x € H,, we get Hac = Hi. It remains for us to prove 
that H, is a closed subspace reducing the operator A. Let a sequence {yn} C H, 
converge to some y ; then there are sets N, € Bo such that Ei4(Nn)yn = Yn. Their 
union N := U%, Nn also belongs to By, so E,4(N)yn = Ea(Nn)¥n = Yn and the 
limit gives E4(N)y = y. Hence the set H, is closed; in a similar way we check that 
it is a closed subspace. It reduces A iff EM x € H, holds for any x € H, and 
t € RR; this is true because F4(N,)x = x, and the operators E',4(N,) and E“ 
commute. ff 


Due to Problem 4.24b, the operators Age := A} Hac and A, := A}H,, which 
we call respectively the absolutely continuous and singular part of the operator A, 
are self-adjoint. Their spectra are respectively called the absolutely continuous and 
singular spectrum of A, and we have 


A = Aac®As, (A) = Oae(A) Uo,(A). (5.10) 


The operator A also has other orthogonal-sum decompositions; we know from 
Example 4.4.3 that A = Ap@® Ac, where A:= A} Hp and Hp = DXeu,¢a) N(A). It 
is not difficult to check the inclusion H, C H, (Problem 27); if H, is a proper sub- 
space we denote its orthogonal complement in H, by Hg-, and set Asc := A) Hee 
and dsc(A) := o(Asc). By definition, A,, has no eigenvalues. This means, in 
particular, that #H,, is either infinite-dimensional or trivial, and furthermore, that 
the spectral decomposition of As- is continuous. The operator A, is called the 
singularly continuous part of A and o.-(A) is its singularly continuous spectrum. 
Problem 4.24b now gives As = Ap @ Asc and o,(A) = o,(A) Ua,-(A) ; combining 
these relations with (5.10) we get 


A = Ap@ Ac @ Ase, (A) = Op(A) U Gac(A) U osc(A) . (5.11) 
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5.4.8 Remark: The operator A, := A)H> has no eigenvalues, i.e., a purely 
discrete spectrum; this is why the continuous spectrum of a self-adjoint operator A 
is often defined as the spectrum of A,. This definition differs from that of Section 1.7 
because the sets op(A) and o(A,) are in general not disjoint (Problem 24c). An 
isolated eigenvalue which is not an isolated point of o(A) belongs to o(A-) ; such 
eigenvalues are said to be embedded in the continuous spectrum. 


5.4.9 Examples: (a) Consider the space L*(JR, du) with an arbitrary Borel mea- 
sure p and the operator Q, : (Quy)(x) = xy(z). Let = pac + Us be the 
Lebesgue decomposition of with respect to the Lebesgue measure on FR ; 
then there is a set N, € Bo such that w,(R\ N,) = 0 and pac(N,) = 0. 
Denote H; := L?(UR, du;) ; it is easy to check that 


(U lees bs])(2) = aes . acai 


is a unitary operator from Hae ®@H, to L?(IR,dy) and a vector ~ € H, 

if w=xn,¥- Since (Eg(M)p)(x) = xmu(z)v(x) by Example 5.3.11b, we 
conclude that the absolutely continuous and singular subspaces are in this 
case isomorphic to L*(JR, doc) and L?(JR,du,), respectively. In particular, 
the spectrum of the operator Q on L?(JR) is purely absolutely continuous, 


Wte(Q) = LR) and o(Q) = Fac(Q) =R. 


(b) Let us ask under which condition the operator Ty € Lea(L?(JR")) corresponding 
to areal Borel f has a purely absolutely continuous spectrum, 7.e., Hac(Ty) = 
L?(UR") and oa-(Ty) = 0 (Ty). Due to Examples 5.2.8 and 5.3.11b, the spectral 
measure of Ty is Ey := E(f‘-”(-)), so 


a 2 
LENO? = fi yy W@)Pae 
holds for any w € L*(R") and N € B. It is sufficient therefore to require 
that f'-")(N) c IR” has Lebesgue measure zero for all N € B of Lebesgue 
measure zero — see, e.g., Problem 28. 


5.5 Functions of self—adjoint operators 


The spectral theorem together with functional—calculus rules gives us a tool to asso- 
ciate a class of operators with a given self-adjoint A. As usual, E, is the spectral 
measure of A and ®'4) := ®,, will denote the set of all E,4-a.e. defined Borel 
functions y : R — €. In particular, any function continuous on the spectrum 
belongs to this set, C(a(A)) c &). 

The operator T‘4)(y) = fp y(t) dEa(t) is for any y € ®'4) fully determined 
by the function y and the operator A. It is customary to denote these operators as 
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(A) and call them functions of A. This notation is less cumbersome, particularly 
when we deal with elementary functions; one should keep in mind, of course, that 
y rather than A is the “variable” in such a “function”. 


5.5.1 Examples: (a) The multiplication operator Q, on L?(JR,du) with a Borel 
measure y can be written as Q,, = Tg. Moreover, Example 5.2.8 implies 


T,= | f()aB(t) 


for any Borel function f : JR — € where the projection—valued measure is 
determined by E™)(M) := Ty,,, M € B; in the particular case f = id 
the above formula represents the spectral decomposition of the operator Q,.. 
Given a function y € ®‘%) we can extend it arbitrarily to a function defined 
on the whole JR denoted by the same symbol; since the extension concerns an 
E™)-zero set, Problem 6c implies y(Q,,) = Tp. 


(b) functions of the operator P on L?(JR): By Example 4.4.7, P is unitarily 
equivalent to the operator Q ; in view of Problem 12 and Example 5.3.11c, 
this implies 6?) = 6%) and y(P) = F-'y(Q)F for any y € ®). Since 
the Fourier—Plancherel operator has a simple functional realization, we are in 
some cases able to express the operator y(P) explicitly. For instance, e” is 
for any a € JR a unitary “substitution” operator which acts at an arbitrary 
y € L?UR) as 

(cP) (x) = o(a+a). 


The translation operator U,: Uaw = w(-+4a) is unitary by Example 3.3.2, so 
it is sufficient to check the identity e” = U,w for all ~ of some dense set 
in L?(JR). Using Example 1.5.6 we easily find that e@?Fy = FU,y holds 
for y € L?UR) 9 L'(UJR) ; hence the result follows from the above unitary 
equivalence. 


As another illustration, consider a function y € L*(JR) ; then y(P) has the 
following integral—operator representation, 


((P Ya) = (2m)? f (Fe)(y—2)W() dy. (5.12) 


To check this formula, notice that the product of » and 7 := Fwy be- 
longs to L'(JR), so the relation (3.2) yields y(P)~(x) = (F-4yn)(x) = 
(2r)-1/2(9,, Fw) for a.a. x € IR, where 0,(y) := e~%p(y). Let Th de- 
note the operator of multiplication by x(-nn); then T,,0. € L? ML’ and since 
F is unitary, we have (6,, Fw) = limp.o(F~!T,62, ~~). Next we have to write 
the expression on the right side explicitly, and to make a simple substitution in 
the appropriate integral; using then the identity Uz = e*" proven above, 
we get 
Vn (o(PY#)(e) = Jim (F-"Tnbo, Py) = fF (o)Hla+y) dy. 


M—+ 090 
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However, (F-!9)(y) = (Fy)(y) , so the relation (5.12) is valid for an arbitrary 
w € D(p(P)) = F-'D(y(Q)) andaa. re R. 


The spectral measure of a self-adjoint A is itself a function of A and as such 
it can be expressed by means of other functions of the same operator. 


5.5.2 Example (Stone formula): Given € > 0, a finite interval J := [a,b] with 
b> a, anda number u € J, we define the function ~™ : R > IR* by y(t) := 
Im (t—u—ie)~! = e[(t—u)?+ €?]-!,, and furthermore, 


yet) = ir w(t) du = arctan (=) — arctan (=) ; 


by Theorem 5.5.2d we then have s-limeiopye(A) = § [Ea[a, 6] + Ea(a,6)]. To 
express the left side of the last relation, notice that the function (t,u) > y(t) 
fulfils the assumptions (i)—-(iii) of Proposition 5.2.5. At the same time, we have 
yo = £(Ra(utie)—Ra(u—ie)) , so the first resolvent formula in combination with 
the inequality ||R4(u+ie)|| <7! gives the estimate || (AW (A)]|| < e7?|u-o] , 
which shows that assumption (iv) is valid too. Hence the proposition may be applied: 
we get pl) = 2 fe [Ra(utie)—R,4(u-ie)] du, which together with the above limiting 
relation provides an expression of the spectral measure in terms of the resolvent, 


1 b 
Eala, 6] + Ea(a,b) = = slim [ [Ra(utie) — Ra(u—ie)] du, (5.13) 


which is known as the Stone formula. 


Since the resolvent set of a self-adjoint A is FE4-zero, the operator y(A) is 
by Theorem 5.2.9d fully determined by the restriction y)o(A). If this function is 
continuous, we can strengthen the assertion of Theorem 5.2.11la. 


5.5.3 Proposition: Let y € C(o(A)), then o(y(A)) = y(o(A)) ; in particular, 
o(p(A)) = y(o(A)) if A is bounded. 

Proof: If y= p(x) for some z € o(A), then due to the assumed continuity we can 
find for any 6 > 0 a neighborhood U(y) such that (rx—6,2+6) C yi (U(y)). 
Since x € o(A), we have E4(x—6,2+6) #0, and therefore E,(y'-»(U(y))) £0, 
i.e., y € RE) (y) = o(y(A)). This yields the inclusion y(o(A)) C o(y(A)) , which 
implies y(o(A)) C a(y(A)) because the spectrum is closed. On the other hand, 
any point z ¢ y(o(A)) has a neighborhood U(z) disjoint with y(o(A)) ; hence 
y-)(U(z)) No(A) = @ holds for the pull-backs, which means that z ¢ o(y(A)). 
This proves the first assertion; the rest follows from the fact that the spectrum of a 
Hermitean A is compact (cf. Proposition 1.3.1c and Theorem 1.3.3b). 


The functions of a self-adjoint A obey the usual composition rules. If w € (4) 
is a real-valued function, then the spectral measure of w(A) is Ey := E(w‘-(-)) 
by Example 5.3.11b, and Proposition 5.2.12 gives (pow)(A) = y(w(A)) for any 
Borel function y: R —@ which is E,,-a.e. defined. 
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5.5.4 Example: The function |id| : x +> |z| is Borel and positive, so we can 
associate with any A € £,,(H) the positive operator |A| := |id|(A) with the 
domain D(|A|) = D(A) ; in particular, |A| = A if A is positive. The composition 
tule yields the identity |A| = (A?)!/?,, which shows that for Hermitean operators 
the present definition of |A| coincides with that of Section 3.2. 


Next, we want to show that the set of all functions of a given self-adjoint A 
can be characterized solely by its commutativity properties. Given a set S of linear 
operators, not necessarily bounded, on a Hilbert space H we define its commutant 
as S':= {Be B(H): BT CTB forall Te S}, and its extended bicommutant as 


Se {)Telin): BIreTR ioral Bes}: 


it is clear how these definitions modify the general algebraic notions of commutant 
and bicommutant — cf. Appendix B.1. The assertion of Theorem 5.3.1b can now be 
written as {A}’ = {E®: te IR}! = {E4(M): M © B}’, where the last identity 
follows from Problem 3. In combination with Proposition 5.2.11b, these relations 


yield the inclusion 
{y(4): pe@} c {A}; 


we are going to show that in the case of a separable H it turns into identity. We 
need an auxiliary assertion. 


. 5.5.5 Lemma: Let A be a self-adjoint operator on a separable H and denote by 
G(x) the closed linear hull of the set {E4(J)x: J € J}, where J is the family 
of all bounded intervals in R. Then there is an orthonormal set {r,}*_,, N < 00, 
such that H = @*_, G(zn). 

Proof: By the functional—calculus rules, a vector y belongs to G(x) iff there is a 
sequence {s,} of step functions such that y = s-limp—oo$n(A). Since the product 
of a step function with x, for each J € J is again a step function, the last 
relation implies E4(J)-invariance of the subspace G(x) for any x € H. Denoting 
the corresponding projection as P(x) we get P(x)E,4(J) = E,(J)P(x) for an 
arbitrary J € J, and furthermore, P(x)x = x, where we have used the relation 
x= lim, E,(—n,n)z. 

Since H is separable there is a countable set M := {y;} C H dense in H ; 
without loss of generality we may assume that all its elements are nonzero. We 
sete 2) i= gel so P(21)y, = yw. If P(xi)y; = y; holds for all 7 > 2 the 
condition M = H implies P(x,) = J and the assertion is valid with N =1. In 
the opposite case we set jo = min{j: P(xi)y; # y;} and define x2 := 2}/||x}|| 
where x}, := (I— P(2))y;, ; we shall show that the projections P(x1) and P(z2) 
are orthogonal. The above derived properties of the projections P(x) show that 


P(a1)P(a2)z = lim P(21)sn(A)z = lim, sn(A)P(a1)(I—P(21))¥, = 0 


atte ran & 


for any z € H, i.e., P(x1)P(x2) =0. Moreover, using the relations P(x2)x, = x4 
and y;, = %,+P(x1)y;, we can check that (P(x1)+P(x2))y; = y; holds for 7 = 72, 


180 CHAPTER 5. SPECTRAL THEORY 


and thus for any 1 < j < je. In the next step we choose j3 as the smallest 
j such that (P(x,)+P(x2))y; # y;, etc.; by induction we prove that after the 
n-th step we have an orthonormal set {x1,...,2,} such that the corresponding 
projections P(z,),...,P(¢n) form an orthogonal family and }y-, P(re)y; = yj 
holds for 1 < 7 < jn. The process is either terminated after a finite number of 
steps, so Py; = y; for all 7 where P := +, P(zn) , or the last relation is valid with 
P :=$-limp oo De, P(xp). In both cases P is the projection onto @, G(zp) , and 
the condition M =H implies P = /, concluding the proof. § 


5.5.6 Theorem: Let A be a self-adjoint operator on a separable 1 ; then the 
condition T € {A}, is valid iff T = y(A) for some Borel function y: IR— C. 


Proof: We have to check the necessary condition. First we shall find for any y € Dr 
a Borel function yy: IR + € such that y € D(py(A)) and y,(A)jy = Ty. By 
the proof of the previous lemma, P(x) € fEA }’ = {A}’ for any zx € H, and since 
T € {A}%, by assumption, we get the inclusion P(x)T C TP(zx). This in turn 
implies P(y)Ty = TP(y)y = Ty, t.e., Ty € G(y) for any y € Dr; hence there is 
a sequence {s,} of step functions such that Ty = s-limnoSn(A)y. This means, 
in particular, that the sequence {s,(A)y} C H is Cauchy, and by the identity 
(5.6), {sn} is Cauchy in L?(R,duy). The measure py = (y, Ea(-)y) is Borel, 
so the sequence converges to a Borel function y, € L?(IR,duy) ; it follows that 
y € D(y,(A)) and y,(A) = Ty. 

In the next step we shall construct a vector y € Dr such that TBy = y(A)By 
holds for py := yg and all B € {A}’ ; we shall use the orthonormal set {z,} of 
Lemma 5. Since T is reduced by P(x), so P(x)Dr C Dr, and there is a dense set 
{y;} C Dr (cf. Problem 4.1), the construction used in the previous proof shows 
that we may assume {tn} C Dr. We set ¥ := limmoo Dre; Cndn, Where {Cn} Cc @ 
is a sequence of nonzero complex numbers such that 7°, |cnl?||Txnll? < oo ; we can 
choose, €.9., Cn ?= (27(1+||Tan||2))7 * These requirements in combination with 
the relation Tz, € G(x,) and the orthogonality of the subspaces ensure existence 
of the limit limm oo 7L1 CnT Zn ; the closedness of T then implies y € Dr. Due 
to the previous part of the proof, y € D(p(A)) and y(A)g = TY, and since the 
operators T and (A) belong to {A}?%,, we have By € Dr NM D(y(A)) for any 
Be {A} and TBg = BTy = By(A)y = y(A) BY. 

To prove that y is the sought function we introduce the projections E), := 
E,(M,), | = 1,2,..., where M := {t € R: |p(t)| < 1}. These belong to 
{A}’ for any |, the operators y(A)E, are bounded (see Proposition 5.2.6), and 
s-limoo/ = I. Moreover, the commutant {A}’ contains all the projections P(r) 
as well as any step function s(A), so we may use the above derived relation with 
B = E\s(A)P(an). We know that P(2n)¥ = Cn%n, and since the coefficients c, 
have been chosen nonzero, we get TE)s(A)tp = y(A)E,s(A)an for all l,n = 1,2,... 
and any step function s. 

Consider an arbitrary vector z € H. By Lemma 5, it can be expressed as 
2 = liMmioo Lhe) 2n With zn € G(rn), and for any m,n = 1,2,... there is a step 
function Spm such that ||zn—Snm(A)£n|| < 27"m7!. This implies z = limm—oo z™ , 
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where 2°") := "1 Snm(A)tn € Dr. The operator y(A)E; is bounded, so 
P(A)JEiz = limmerco DE, (A) EiSam(A)an = limmseo TE:z™ ; since T is closed, 
we have fiz € Dr and y(A)Eyz = TE,z for 1 =1,2,.... The rest of the argument 
is simple: if z € Dr, then TE,z = E,\Tz — Tz and the closedness of y(A) yields 
z= limi. Kz € D(y(A)) and y(A)z = Tz, i.e., T C y(A) ; interchanging the 
roles of T and y(A), we get the opposite inclusion. 


In the remaining part of this section we are going to show how the above re- 
sults generalize to “functions” of a finite set of commuting self-adjoint operators 
Aj,...,An. For the sake of simplicity we shall consider the case N = 2 ; the 
extension to other finite sets is straightforward. 

. We shall therefore consider a pair of commuting self-adjoint operators A, A’ 
with the spectral measures E := E,4 and F := Ey, respectively; recall that the 
commutativity is by definition equivalent to E(M)F(N) = F(N)E(M) for any 
M,N é€ B. By Proposition 5.1.9, the pair E,F determines a unique product 
measure P. We denote the set of all complex Borel functions on JR? which are 
P-a.e. defined as @p ; then the operators y(A, A’) :-= T”)(y) with yp € Op 
will again be called functions of the commuting self-adjoint operators A, A’. Using 
properties of the Cartesian product together with Proposition 5.4.1c we easily find 
that the set IR? \ (o(A)xo(A’)) is P-zero, so ®p contains all Borel functions 
defined on o{A)xo(A’); in particular, we have C(a(A)xoa(A’)) C ®p. 

In some cases, y(A, A’) can be expressed in terms of functions of the operators 
A and A’. As an illustration, we shall extend the conclusions of Example 5.2.4 to 
unbounded functions. Consider y € ®g and a Borel function y; on JR? which 
fulfils F-a.e. the relation y, = yxe, 2.e., yi (x,y) = v(x), and therefore belongs 
to ®p. If s is a o-simple function then s, := sxe is again o-simple, and since 
p)(M) = pP)(MxIR) for all zx € H and M € B(H), wesee that s € LPR, du) 
implies s; € L?(IR?,du\P)) and the corresponding integrals equal each other; we 
shall use this fact to prove the identity yi(A, A’) = (A). 

Without loss of generality we may assume that the functions y and y are real— 
valued; then the operators y,(A, A’) and y(A) are self-adjoint and it is sufficient, 
e.g., to check the inclusion y(A) C yi(A,A’). Let x € D(p(A)) so there is a 
sequence of o-simple functions s, € L?(IR,du\)) such that ||y*—splloo +0. The 
sequence {y?—s,xe} C LUR?, du?) then converges to zero uniformly in IR?; hence 
y? € LUR?, du), i.e, 2 € D(yi(A, A’)). In a similar way, we check the identity 
(x, y(A)z) = (x, ,(A, A’)x), which yields the sought inclusion by the standard 
polarization and density arguments. The same reasoning applies to functions which 
depend on the second argument only; see also Problem 33b. 


5.5.7 Proposition: With the above notation, the relations 
(pxe)(A, A’) = g(A), (exy)(A, 4’) = ¥(4), 
((pxe) + (exp))(A, 4’) D p(A)+ (4), 
(py)(A, 4’) > 9(A)y(4’) 


are valid for any y € ®g and w € Ff ; the necessary and sufficient conditions 
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under which the inclusions turn into identities are given by Theorem 5.2.9b,c. 


5.5.8 Example: Consider the space L?(R?,d(u1 ®@ 2)), where p41, We are Borel 
measures on JR, and the operators Q, of the multiplication by the r-th coordinate 
defined as Q, := Tia, with id,(x1,22) := x,. The product measure p41 @ pe Is 
again Borel; then due to Example 5.2.8 Eg : Eg(M) = Ty, is a projection— 
valued measure and Q, = T‘¥®)(id,). The operators Q, are self-adjoint because 
the functions id, are real-valued, and their spectral measures are given by Eg, = 
Eg(idS-)(-)) ,so Eg,(M) = Ee(MxIR) and Eg,(N) = Eg(IRxN). Multiplying the 
last two expressions we get the relation P(MxN) = Eg(MxN) forany M,N EB, 
so the two projection-valued measures coincide, P = Eg. Hence the operator 
v(Q1,Q2) := T'?)(y) is for any y € ®p nothing else than the multiplication by 
this function, 7.e., (Qi, Q2) = T, 

Notice that in view of Example 2.4.5, the operators Q, can be expressed alter- 
natively in the tensor-product form, Q; = Q,, ® Iz and Q2 = I; @ Q,,, ; we have 
to use the fact that the operators whose closures are taken on the right side are 
e.s.a. (see Theorem 5.7.2 below). 


5.5.9 Example: Let p;(t) := t? ; then Example 5.2.10 gives (p; x px)(A, A’) C 
p;(A)p,(A’) = A?(A’)*. Interchanging the operators A, A’ we see that A/(A’)*r = 
(A')FA%x holds for all « € D(A%(A’)*) MN D((A‘)*A?) and any 7,.k = 1,2,.... In 
particular, commuting self-adjoint operators obey the relation 


(AA'— A'A)n = 0, 2 € D(AA')ND(A‘A). 


Let us stress, however, that in distinction to the bounded-operator case this condi- 
tion is necessary but in general not sufficient for commutativity of A and A’ (cf. 
Example 8.2.1). 


Some other properties of commuting self-adjoint operators are collected in Prob- 
lem 33. In conclusion, let us mention how Theorem 6 generalizes to the present 
situation. Using Problem 33f we get the inclusion 


{9(A,4) : pe Dp} Cc {AA} | 


it turns into identity if the underlying Hilbert space is separable. 


5.5.10 Theorem: Let A, A’ be commuting self-adjoint operators on a separable 
H; then T € {A, A’}%, iff T = (A, A’) fora Borel y: R? —C. 


Proof: Mimicking the proof of Lemma 5 we get the > sean H = @*_, G(r), where 
{2m}, is an orthonormal set and G(ap) := { P(J)a_n: J € J}. In view of the 
above inclusion, we have to find to a given T € L AH), which satisfies BT c TB 
for all B € {P(-)}’, a Borel function y : R? > € such that (A, A')=T ; te this 
end the proof of Theorem 6 can be used with a few evident modifications. J 
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5.6 Analytic vectors 


For any B € B(H) we can define e? as the appropriate power series convergent 
in the operator—norm limit — see Problem 1.66. This Sera makes no sense if 
T is unbounded; however, it is possible that the series  %, 7; T*x converges for 
semen e C™(T) =e D(T*). Notice that C™(T) is dense for a self-adjoint T ; 
otherwise it can consist of the zero vector only, even if T is symmetric (Problem 35). 
A-vector xz € C™(T ) is called an analytic vector of the operator T if the power 
series 7°29 ||T"z|| = has a non-zero convergence radius; we denote the latter as 
rr(x) or simply waar It follows from the definition that the analytic vectors of a 
given T form a subspace which contains, in particular, all eigenvectors of T. 


5.6.1 Proposition: Let z be an analytic vector of a self-adjoint operator A ; then 
n oy 
izA tA, 2 (iz) k 
ze D(e ) awe saa nn 


holds for any complex z with |z| < r(x). 
Proof: We denote n,(t) := e*”* ; then Theorems 5.2.2d and 5.2.9¢ yield 


n sy\k 
oe ; tz ae 
[exis dEo = n2(A)Ea(—j,j) = slim 7 ee 2 A Ea Geil) 

k=0 | 

for any positive integer 7. From here we get the estimate 
1/2 z 
(jf lmlPr-indue) < SE yAtEa(-aj)al < 5 EP yAtel < 00; 
=() 


together with Fatou’s lemma, this implies fm |nz|?due < 00, te, x € D(n,(A)). 
é . k 

Next we use the estimate |e’ — 75 a < el ; since x € C*(A) N D(e'#4) 

and fpe?""ldu.(t) < co holds for any r € (0,r(x)), the assertion follows from the 


identity (5.6) and the dominated—-convergence theorem. § 


The concept of an analytic vector provides us with a tool for checking essential 
self-adjointness as the following theorem shows. 


5.6.2 Theorem (Nelson): A symmetric operator whose analytic vectors form a 
total set is essentially self-adjoint. 

Proof: Consider first the case when a symmetric A has equal deficiency indices. 
This ensures the existence of self-adjoint extensions; let S be one of them. If z 
is an analytic vector of A, it is at the same time an analytic vector of S and 
ra(x) = rs(x) =: r(x) ; we claim that the function F : z+ (y,e5x) is for any 
y EH analytic in the strip G:= {z € @: |Imz| < $r(x)}. Setting z =u+iv 
we can write it as F(z) = f(u,v) +2g(u,v) , where 


iene te e~” cos(ut) dvyz(t), g(u,v) := I e” sin(ut) dvyx(t) 


184 CHAPTER 5. SPECTRAL THEORY 


and Vyz := (y,Es(-)x) ; we have to check that the functions f and g have in 
Rx (-$r(z),$r(x)) continuous partial derivatives which fulfil the Cauchy-Riemann 
conditions 0,f = d,g and 0,f = —Oug. This is obviously true for the integrated 
functions, and moreover, their partial derivatives are in IR x [-r,r], where r € 
(0, 4r(x)), majorized by the function h : h(t) = |t|e""!. The Hélder inequality 
gives (f |A(t)|?due(t))” < fttdys(t) fet”dus(t), so the estimate of the previous 
proof in combination with the assumption r € C%(S) shows that h € £L°UR, dur). 
Then h is vy,—-integrable by Proposition 5.2.7 and the analycity of F follows from 
the dominated—convergence theorem; Proposition 1 yields the Taylor expansion of 
the function F' around the origin, 


foe) * \k oo i k 
Fee GS =: = ¥. @ Ate) - 
k=0 : k=0 , 


for |z| < r(x). In view of Corollary 4.3.10, we have to check that Ker (A*+7) = {0}. 
We take w € Ker (A*—i) and set y = w in the Taylor expansion. Since (A*)*w = 
i*w, it gives F(z) = e?(w,x) in the disc {z: |z| < }r(x)}. However, the two 
functions are analytic in G, and therefore the identity holds for all z € G (see the 
notes), in particular, F(t) = e’(w,x) for any t € IR. At the same time, e** is 
unitary, so F’ is bounded on RR ; this is possible only if (w.r2) = 0. The vector 
x has been chosen from the set of analytic vectors which is dense by assumption; 
hence w =0 and Ker (A*-i) = {0}. The relation Ker (A*+7) = {0} can be checked 
in the same way. 

Consider finally the general case where the deficiency indices need not equal 
each other. The operator Ag := A@(—A) on H@H is again symmetric, and by 
Problem 4.56a we have n4(Ag) = n_(Ag) = ny(A) + n_(A) ; in particular, Ag is 
e.s.a. iff the same is true for A. If x,y is a pair of analytic vectors of A, then 
[x,y] is an analytic vector of Ag. This shows that Ag has a dense set of analytic 
vectors in 2#@ 4H and by the first part of the proof itises.a.. § 


There is a useful modification of Theorem 2 based on the notion of a semianalytic 
vector of a given T’. This is what we call any vector z € C™(T) such that the 


: an * * 
power series Sno Gay |T"z|| has a nonzero convergence radius (which we denote 


by o(T)). 


5.6.3 Theorem (Nussbaum): A positive operator whose semianalytic vectors form 
a total set is €.s.a. . 


Proof: Let A be positive; then by Theorem 4.6.11 it has a positive self-adjoint 
extension S > A. If x is a semianalytic vector of A it is at the same time a 
semianalytic vector of S ; mimicking the preceding proof we can check that it 
belongs for any |z| < 0(x) to the domain of the operator cos(zVS). In particu- 
lar, we have {5° cosh(2rvVt) dus(t) < oo for any r € (0, o(x)), and therefore also 
 ervt du. (t) < oo. In the same way as above we check that the function F : 
F(z) = (y, cos(zVS)z) is for any ye H analyticin G:={ze@: |Imz| < $0(x) } : 
the relevant majorizing function which allows us to differentiate under the integral 
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sign is in this case h(t) :-= 4 Vt (e’* + 1). The Taylor expansion now reads 


cos(zV'S)x = lim. Ie =i Ata 


for |z| < o(x). Choosing an arbitrary vector w € Ker a + i), we get from here 


F(z) W,x +i)* 

= (u,z) > (i a 
for |z| < o(x) ; however, the right side is an entire function so the identity extends 
to the whole strip G. In particular, we have |F(u)| = 271/?|(w, x)|[cosh(/2u) + 
cos(/2u)]'/* for all u € JR, and since cos(zVS) is a bounded operator, we finally 
find (w,x) = 0 for any semianalytic vector of A, i.e, Ker(A*+i1)={0}. I 


5.6.4 Example: Consider the operators Az := 27/2(Q + iP)}S(JR) on L?(R) 
which obviously map S(JR) into itself. The operator H) with the domain D(H) := 
S(R) defined for any 4 € R by 


i ae + : + ae = [5(@+ P?) +9! / SUR) (5.14) 


is symmetric; it is positive provided \ > 0. We shall show that the hn, n = 1,2,..., 
defined by (2.1) are semianalytic vectors of the operator H). It follows from (5.14) 
that H* can for any positive integer k be expressed as a sum of 18* terms of the 
form cXA,A2...A4x, where |c,| < max{1,|A|/4} and each A;, 7 = 1,...,4k, is 
equal to one of the operators J, A,, A_. 

Using the well-known functional relation between Hermite polynomials we find 
Agha = V/n+I1has and A_hn = V/nhn-1, where in the case n = 0 the last relation 
reads A_hp = 0. By induction, we get the estimate 


(Ar Anhal < | Veto ea 
j=1 


which shows that the series 772.9 2 oui ~ ||H*hal| converges if |z| < (72 |c,|)~!/?. Thus 
by Nussbaum’s theorem, H) is e.s.a. for any 4 > 0. Notice that the above estimate 
does not imply analyticity of the vectors hn (see also Problem 36). 


5.7 Tensor products 


Given a pair of self-adjoint operators A; and A2, it is natural to ask about the 
relations between their spectral properties and those of their tensor product, or more 
generally, polynomial combinations of tensor products. We shall consider operators 
of the type 


ny n2 


P{A1, Ao] = >>> au( Ak @ Ad) (5.15) 


k=0 [=0 
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with the domain Dp = D(P[Ai, Ag]) := D(AT")x D(AQ’), where 1,n2 are non— 
negative integers and a, are real coefficients. We shall use the notation introduced 
in Section 4.6, A; := A: @J2 and Ay := I; @ Ao. 


5.7.1 Remark: Without loss of generality, we may assume that @km, and Qn 
are nonzero for some k,!. Then Dp is the natural domain for the operator (5.15); 
recall that D(A") > D(A™) holds for n < m. With Remark 4.5.3a in mind, 
we shall also assume that at least one of the operators A,, Az is unbounded; a 
reformulation of the results for the case when both of them are Hermitean is easy. 
Finally, the results extend in a straightforward way to polynomial combinations of 
tensor products A! @---@ Ah for any finite N. 


5.7.2 Theorem: The operator P[A;, Ag] given by (5.15) is esentially self-adjoint, 
and the same is true for its restriction P[{A,, Ag] := P[A1, A2] ! Dix D2, where D, 
is any core of the operator A?", r= 1,2. 


Proof: The operators A” are self-adjoint, so P[A,,A2]| is densely defined and 
symmetric by Theorem 4.5.4 and Proposition 4.1.2. Let E, denote the spectral 
measure of A, ; the set M, := Uy Ran E,(—j,j) is obviously dense in H,. Hence 
M := {x1 @22: z, € M,} is total in 7H, ® H, by Proposition 2.4.4a; to prove 
the first assertion it is sufficient, due to Nelson’s theorem, to show that any element 
of M is an analytic vector of P[Ai, Ao]. Let z, € Ran£,(—j,,j-) ; then the 
functional-calculus rules yield the estimate ||A*x,||? < 72*||z,|/?, which implies 
M, Cc (1%, D(A*®) =: C®(A,), and therefore M Cc C™(P[A;, Ag]). The same 
estimate gives 


| P[A1, Ao} (21 ® 2)|| = | s, Aknin «+> 2». an, Art tz, ® Abn gy, | 
knin kyl; 
< (IP|Qi,52))" lata © wall, 


where we have denoted |P|(ji,j2) := Du laml{j, ; thus the appropriate power 
series converges with the radius r(x; @ x2) = 00. 

The operator P[A, Ag] is again densely defined, and therefore symmetric as a 
restriction of a symmetric operator. Its closure is contained in P[A;, Ag], and since 
we already know that this operator is self-adjoint, it remains to check the inclusion 
P[A;, Ao] C P[Ai, Ae], i.e, to show that to any z € Dp there is a sequence 
{z;} C DixDo fulfilling z; > 2 and P{Aj, Ag|z; > P[A1, Ag]z. Let z = 2; @ 22 
with x, € D(A?*). Since D, is a core for A?" by assumption, there are sequences 
{x7}, C D, such that limj_.o 24 @ 7} = 2 © r and limj.. A™ ai = A™z,. 
It follows that lim;.o(AT ® A})(z1 @ a2) = (A* @ Ab)(x, @ x2) for O< k < my 
and 0 </ < ne (see Problem 37), and therefore also lim; P{Ay, Ao](x3 Q 23) = 
P{Aj, A2](x1 ® x2) ; a linear extension then yields the sought result. § 


Before we start discussing spectral properties of the operator P[Aj, Ag] let us 
mention some simple properties of the operators A, the proof of which is left to the 
reader (cf. Problem 38). 
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5.7.3 Proposition: Let A, € Lsa(H,), r = 1,2, and denote by E,(-) the corre- 
sponding spectral measures; then 


(a) E,(-) is the spectral measure of the self-adjoint operator A, , 
(b) the operators A, and A, commute, 
(c) the spectra of A, and A, coincide. 


By definition, there is a bijective correspondence between the operator P{A,, Ag] 
and the real polynomial P : P(s,t) = Dfho D725 aus*t’. On the other hand, 
since the operators A, and A, commute, there is a unique self-adjoint operator 
P(A,, Ag) associated with the function P : IR? 4 IR. We shall show that 


P(A,, Ay) = P[Ai, Ad]. (5.16) 


Indeed, Proposition 4.5.6 implies A* @ AL c (A,)*(Ag)! C (A;)*(Ae)’, and by 
functional calculus, P(A;, A.) > Nw mane where py(s,t) := s*t!. Fi- 
nally, pw(A,, A.) > (A;)*(A,)! follows from Proposition 5.5.7, so together we have 
P(A,, A>) > P[A;, Ag] ; since the left side operator is self-adjoint and its right side 
counterpart is e.s.a. , we arrive at the stated relation. 


5.7.4 Theorem: The closure of P[A,, Ag] is given by (5.16). The corresponding 
spectral measure is Ep: Ep(M) = yp,,(A,, Ao), M € B, where Py := P&-))(M), 
- and the spectra of the operators in question are related by 


a (PIAi, Aol) = P(o(Ai) x o(Aa)). 


Proof: The remaining assertions follow from Example 5.3.11b and Problem 33c in 
combination with Proposition 3c. & 


5.7.5 Example: Define the operators Aq := A; ® Az and Ap := A, +A). By 
the theorem, their spectra are o(An) = Mp and o(Ay) = Mg, respectively, where 
My := {uu : u€ o(A;), v € o(A2)} and My := {utv : u € o(A)), v € ofA) }. 
Notice that the closure is essential here because the sets My and My need not 
be closed. Consider, for instance, self-adjoint operators with pure point spectra 
o(A,) := ae oe {7a 1,2)... } amd oie {a7 77 =—1,2,...} (eg., the 
operators T,, of Example 4.1.4 for the appropriate sequences s, ). The spectra have 
no accumulation points; however, any integer obviously belongs to o(Ay) \ My. 


Similar relations are valid between the point spectra of the operators under 
consideration — see Problem 39. 
5.8 Spectral representation 


To motivate the problem we are going to discuss here, consider first a self-adjoint 
operator A with a pure point spectrum and suppose that each eigenvalue A; of 
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A is of multiplicity one. The corresponding eigenvectors e; by assumption form 
an orthonormal basis. We define the vector y := 92, j7*e; and set w(M) := 
(y,E4(M)y) for any Borel M, where E(-) is the spectral measure of A — see 
Example 5.3.1la. The set IR \ o,(A) is E4~zero, so any element ~ € L? UR, dp) 
is uniquely determined by the sequence {7(A;)}. Since there is a bijective cor- 
respondence between the vectors e; and the eigenvalues we can define the map 
V:H— L?(R,du) which associates Vx: (Vx)(A;) = 7€ with any r= ¥°; Ge; . 
It is obviously an isomorphism. A vector w € L?(JR,du) belongs to the subspace 
VDa iff Syqlt0(t)|Pdu(t) = Xj |AyOs) 71? < 00 and (VAV~")(Aj) = As@b(As) 
holds for any 7 = 1,2,,...,7¢, (VAV~'W)(x) = zy(z) for p-aa. re R. 

Hence a self-adjoint operator with a pure point nondegenerate spectrum is uni- 
tarily equivalent to the multiplication operator, VAV-! = Q,, on L*(UR, dy) with 
a finite Borel measure p:. This considerably simplifies the analysis of such operators 
because multiplication operators are easy to handle, and the spectral properties of a 
self-adjoint operator are either unitary invariants or they transform in a transparent 
way. It is therefore natural to ask whether the conclusions of the above example 
extend to an arbitrary self-adjoint operator. We introduce the notion of a spectral 
representation (sometimes also called a canonical form) of a given self-adjoint A 
on a Hilbert space H: by definition it is an operator T; of multiplication by a real 
function f on L*(X,dy) which is unitarily equivalent to A ; i.e., there is a unitary 
operator V : H — L?(X,dy) such that VAV~! = Ty. 

It was important in the introductory example that all eigenvalues were assumed 
to be simple; owing to this we were able to construct the isomorphism V using 
the vector y. To introduce an analogy of this vector in the general case, note that 
jPjy = e;, where P; are the appropriate eigenprojections so the set { Pyy: j = 
1,2,...} is total. Given an operator A € Lg(H) we call y € H a generating vector 
of A if the set { Ea(J)y: J € J} is total in H ; here, as usual, 7 means the 
family of bounded intervals in JR. If an operator A has a generating vector, we say 
it has a simple spectrum. 


5.8.1 Proposition: If A € £.a(7#) has a simple spectrum, then H is separable 
and any eigenvalue of A is simple. 


Proof: By assumption, A has a generating vector y. Let J be a subsystem in 
J consisting of intervals with rational endpoints; the set { E4(J)y: J € J,} is 
clearly countable. We denote it as M,(y) while M(y) will be the analogous set 
corresponding to the whole family 7. Using the right continuity of the spectral 
decomposition and Proposition 5.1.3 we find that M(y) c M,(y) ; the separabilty 
of H then follows from Lemma 1.5.2b. Let further \ be an eigenvalue of A and 
denote the corresponding eigenspace Ran E4(A) as N(A). If z € N(A)N {y}d,, 
we have 


(Ea(J)y,z) = (y, Ba(J0 {A})z) = xu(A)(y, 2) = 0 


for any interval J C R,so z€ {E(J)y: J Cc R}+ = {0}. Problem 3.18a then 
implies dim N(\) < 1, and since the opposite inequality is valid for any eigenspace, 
we get dimN(A)=1. § 
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5.8.2 Example: The multiplication operator Q, on L?(IR,du) where p is an 
arbitrary Borel measure has a simple spectrum. To prove this claim we employ the 
fact that {yy: J € J} is total in L?(IR,dy) by Example 1.5.3c in combination 
with the relation E*(M) := Eg,(M)=T,,, — cf. Example 5.2.8. If the measure 
we is finite, then the function ~ : w(x) = 1 can be chosen as the generating vector. 
In the case uUR) = oo the situation is slightly more complicated. Since p is 
supposed to be Borel, there is a disjoint decomposition {J,} C 7 such that R= 
Up Je and p(J,) < oo for all k. The set {yy,} C L?(R,du) is orthogonal and 
the vectors pn := pi, (2"ux)7!/?xy, with pw, := max{p(J,),1} form a Cauchy 
sequence. We shall check that ~ := limp.o%n is a generating vector of Q,. 
Indeed, any J C J has a finite disjoint decomposition J = Oe a: (OF), ao 
xo = Dee, (2x)? B4(Je J). It means that the set {EY (J)y: J € Thun 
contains {y,;: J € J}, and since the latter is total, the assertion is proved. 


An important property of the class of operators with a simple spectrum is that 
we can easily find their spectral representation. 


5.8.3 Theorem: An operator A € L..(H) with a simple spectrum is unitarily 
equivalent to the operator Q, on L?(JR,d), where p is the finite Borel measure 
which corresponds to the spectral measure F’4 and a generating vector y of A by 
w= (y, Ea(-)y). 

Proof: Consider the subspace S C L?(JR,du) consisting of simple Borel functions; 
by Problem 1.47b we have S = L?(JR,du). The operator o(A) is bounded for any 
o € § and the functional—calculus rules imply G:= {xe H: x=a(A)y,o € S} 
is a subspace; since it contains the set {E,(J)y: J € J} we have G = H. 
Now we put Voo(A)y:=o. The map Vy: G > L?(R, dy) is well defined because 
o(A)y = &(A)y implies ||o—o||, = 0, where ||-||, is the norm in L?(JR, du) , as can 
be seen from the relation (5.4). Moreover, we have ||o(A)y||? = |lo||2 = {[Voo(A)yl[2,, 
and since Ran Vo = S,, the continuous extension of Vo is a unitary operator from 
H to L?(R, du) ; we denote it as V. 

We shall show that VAV~! = Q,,. The operator on the left side is self-adjoint 
with the spectral measure F := VE ,(-)V~! — cf. Example 5.3.11c. We have 
yuo € S forany Me B and o € S,s0 F(M)o = Vxm(A)o(A)y = xuc. Since 
the operators F and T,,, are bounded and S is dense in L?(IR,du), the last 
relation implies F(M) = T,,, for all M € B. Using the notation of Example 5.2.8, 
we end up with F = E™ and VAV-! = fptdE“ (t)=Ta=Q,. OW 


If the spectrum of A is not simple the spectral representation does also exist 
but the construction is more complicated; the generating vector has to be replaced 
by a certain generating set. We limit ourselves to formulating the result; references 
containing the proof are given in the notes. 


5.8.4 Theorem: Let A be a self-adjoint operator on a separable H ; then there is 
a space L*(X,du) with a finite measure, a measurable function f : X — JR, and 
a unitary operator W : H — L?(X,du) such that WAW-1 =T,. 
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In the rest of this section, we shall mention an algebraic criterion for an operator 
to have a simple spectrum; in view of Proposition 1 we may restrict our attention 
to the case of a separable #1. We know from Theorem 5.5.6 that the bicommu- 
tant {A}” consists of the operators y(A) with y € L*(IR,dE,). On the other 
hand, any such operator commutes with A, so {A}" Cc {A}’. If A is an operator 
with a pure point spectrum considered in the opening of this section, we can check 
by elementary means that the validity of the opposite inclusion, 7.e., the relation 
{A}"” = {A} is necessary and sufficient for A to have a simple spectrum (Prob- 
lem 42). To extend this result to operators with a nonempty continuous spectrum, 
we shall need the following equivalent definition of the generating vector. 


5.8.5 Proposition: The condition { By: B € {A}”} = H is necessary and suffi- 
cient for y to be a generating vector of an operator A € L4q(H). 

Proof: If y is a generating vector the condition is valid because the operators E4(J) 
belong to {A}”. To show that it is at the same time necessary, consider a vector 
z € {E,4(J)y: J € J}+. The complex measure vzy := (x, Ea(-)y) then fulfils 
Vey(J) = 0 for all J € J, and therefore also vzy(M) = 0 for any M € B 

(Proposition A.4.2). For any B € {A}” there is a function ¢ € L*UR,dE4) such 
that B= y(A); then (2, By) = Jp y(t) dvzy(t) = 0, and since the vectors By by 
assumption form a total set, we obtain r= 0, i7.e., y is a generating vector. {J 


5.8.6 Theorem: Let A be a self-adjoint operator on a separable H ; then the 
following conditions are equivalent: 


(a) A has a simple spectrum, 
(b) there is a vector y € H such that { By: Be {A}"} =H, 


(c) {A}' = {A}”, «e., any bounded operator B which commutes with A equals 
(A) , where ¢ is a bounded Borel function. 


Proof: The equivalence of (a) and (b) follows from the preceding proposition. As- 
sume further that (c) is valid and consider arbitrary z € H and ¢ > 0. By 
Lemma 5.5.5 we can find an integer ne so that ||z — R2, P(an)z\| < ¢, where the 
P(zn) are projections onto the mutually orthogonal subspaces { E4(J)tn: J C J} 
and the vectors x, form an orthonormal set of a cardinality N < oo. Denote 
y:= DM, n-!an,80 Ln = NP(2_)y. By the proof of Lemma 5.5.5, we can find step 
functions 8,,n = 1,...,me, such that ||P(an)z — sn(A)znl| < enz' ; substituting 
for tp, we get ||z—By|| < 2e, where B := %,nsn(A)P(an)y. The above men- 
tioned proof in combination with the assumption gives P(z,) € {A}! = {A}" for 
all n ; hence the relation ||z—By|| < 2e yields (b). 

To prove that (b) implies (c) it is sufficient to check that {A}’ is a commutative 
set — cf. Proposition B.1.2. In view of the decomposition B = ReB+ilmB we 
have to show only that C;C, = C,C; holds for any Hermitean C,,C2 belonging 
to {A}’. By assumption, for any x € H one can find a sequence {Bn} c {A}” 
such that Bpy — x. Since each of the operators C,,C2 commutes with all B,, 
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we see that the sought relation is valid iff C,Coy = C2Ciy, and using assump- 
tion (b) once more we can rewrite it equivalently as (C\Coy, By) = (CoCiy, By) 
for all B € {A}". By Problem 43c, there are sequences of Hermitean B{) € 
{A}",r = 1,2, such that B&)y — C,y. Then (C,Coy, By) = (Coy, BCiy) = 
limnoo(y, BY’ BB y) and the analogous expression is valid for (C2Ciy, By) ; since 
{A}” is a commutative set the proof is finished. 


5.9 Groups of unitary operators 


A family {U(s): s € R} of unitary operators on a given H is called a strongly 
continuous one-parameter unitary group (for brevity, one or both of the first two 
adjectives are often dropped) if the map s ++ U(s) is continuous in the strong 
operator topology and U(t+s) = U(t)U(s) holds for all ¢t,s € IR. It follows from 
the definition that the set {U(s): s € IR} is commutative, and furthermore, the 
unitarity implies U(0) = J and U(—s) = U(s)-! = U(s)*. 

Given a unitary group, we denote D := {x € H: lim,.9 s-!(U(s)—I)z exists } 
and define the operator T with the domain Dr := D by 


it is called the generator of the group { U(s): s € IR}. The group determines the 
domain D uniquely. Moreover, the group condition U(t+s) = U(t)U(s) implies 
U(s)D c D for any s€ JR and 


TU(eja — UiGwe = lim aS = i F(s)e. (5.17) 


Note first that a unitary group can be associated with any self-adjoint operator. 


5.9.1 Proposition: Let A be a self-adjoint operator; then {e*4: s € R} isa 
strongly continuous unitary group and A is its generator. 


Proof: The first assertion follows easily from functional calculus; it remains to check 
that the generator T coincides with A. For any s #0 the function y,: »,(t) = 
(e***—1)/is is continuous and bounded, 2.e., it belongs to L°UR,dE,). Let x € Da; 
then the relation (5.6) gives ||(ys(A) — A)z||? = Jr |ys(t) — t|?due(t). The estimate 
|ys(t) — t| < 2|t| in combination with the dominated-convergence theorem shows 
that lim,.9y,(A)x exists and equals Az ; hence A C T. On the other hand, 
let z € Dr so there is lims49 y.(A)x =: y. Then fm |y.(t)|?du.(t) < (\lyl| + 1)? 
holds for all sufficiently small nonzero s, and since lim, |y,(t)|? = t? , we get from 
Fatou’s lemma the condition id € L?(IR, dur), i.e, rE Da. 


It appears that this example exhausts all possible strongly continuous unitary 
groups on H. This is the content of the following theorem, which has a theoretical 
and practical importance comparable to that of the spectral theorem. 
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5.9.2 Theorem (Stone): For any strongly continuous one-parameter unitary group 
{U(s) : s € IR} there is just one self-adjoint operator A such that U(s) = ein 
holds for all se R. 

Proof: If the operator A with the stated properties exists, it is unique in view of 
Proposition 1. To construct it we shall first apply the definition relation (5.17) to 
the vectors of a particular dense subset in H. We define xy := fm f(t)U(t)x dt for 
any f € C&R) and x € H; the integral exists because the function f(-)U(-)xr 
is continuous and ||f(t)U(t)z|| < |lflloxx«(t)||zl], where K is the compact support 
of f — cf. Appendix A.5. In particular, choosing f := j. with j-(t) :=e7'j(t/e), 
where the support of 7 is contained in [—1,1] and f’, j(t) dt =1, we get 


lz — x3, < [Ol O-Del dt < i 3 I(U@)-Dzl| 


for any x € H ; the strong continuity of U(-)z then implies that the subspace 
Do := {x¢: 2 EH, f € COUR) hin is dense in H. 

Let T be the generator of { U(s): s € IR} ; we shall show that Do C Dr and 
Tz; =ixy forall re H and f € Co°. We have 


Ee) Ly = = f f(t)[U(s+t) —U(@)]2dt = | Ae= @) = FO veep ay 
is is JR R is 
for any s #0. The derivative f’ again belongs to C§° , and it is therefore bounded, 
| f’(t)| < Cy for all t € IR. It follows that the norm of the integrated function on 
the right side is majorized by C'sxx(t)||z||, where AK is the compact support of 
f. Hence Theorem A.5.3 may be applied; performing the limit s — 0 in the last 
relation we get Trp =ifm f'(t)U(t)xdt = ixy for any xz € Do; in other words, we 
have found an explicit expression for the operator Ay := T} Do. 

The subspace Do is obviously Ao-invariant. Moreover, U(s)x; = x¢_, , where 
f_. := f(-— 8), and since C§°(U/R) is invariant with respect to translations, Do 
is also U(s)-invariant for all s € IR and AoU(s)z = U(s)Agz = —i (8) z for 
any z € Do. The operator Ao is densely defined, and using the unitarity of U(s) 
we can readily check the identity (Aors,yg) = (xy, Aoy,) for any z,y € H and 
f,g € C§°UR) which means that Apo is symmetric. Suppose now that there is a 
vector y € D(A}) such that Ajy = iy ; then 


d 
% WU) = (u Zs) = HAW) = GUO) 


holds for all z € Do. This differential equation with the initial condition U(0) = I 
is solved by (y, U(s)z) = (y,z)e°® ; to avoid contradiction with the boundedness of 
the function (y,U(-)z), the vector y must belong to Dg. However, Do is dense, 
so Ker(Aj—i) =0. The relation Ker (Aj+i) = 0 can be checked in the same way, 
t.e., Ag is essentially self-adjoint. 

It remains for us to prove that the operator A:= Ap fulfils V(s) := e4 = U(s) 
for all s € RR ; it is obviously sufficient to check that the function y : y(s) = 
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U(s)z—V(s)z assumes zero value only for any z € Dy. We know that 42(s)z exists 


and equals 7ApU(s)z. The condition z € Dp C Dy, in combination with the identity 
|E4(M)V(s)2\/? = ||E4(M)z\\?, M € B, implies V(s)z € D4 ; on the other hand, 
U(s)Do C Do C Da, so y(s) € Da for all s E JR. Using Proposition 1, we get 
y'(s) = iAU(s)z — iAV(s)z = iAy(s) ; hence ¢ 4 \ly(s) ||? = 2Re (Ay(s), PN) =A. 
Finally, y(0) =0 so y(s) =0 holds for all se IR and z€ Do. 


Stone’s theorem implies an expression for the commutant of a unitary group. 


5.9.3 Proposition: {U(s): s€ R}' = {E,(.)}' ={B: te R} = {A}. 
Proof: An operator B € {E,4(-)}’ belongs to {U(s) : s € R}' by functional 
calculus. Since the second and the third relation follow from Theorem 5.1.6, Prob- 
lem 3c and the spectral theorem, it remains for us to check the inclusion { U(s) : 
ser {Eo . t € IR}’. For any t € R and n > 2, consider the peri- 
odic extension to f) : f(u) = max{0, min{1, $(n?+ 1) Sree — u— =(n?+ 1)|}} 
on [t—n,t +n] ; it is not difficult to check that Witigeies f°) = 400,44) for 
any u € JR. By Fejér’s theorem there are trigonometric polynomials P, such 
thee WO Rillese<in' form — 2)3,... ; thus limpas. P.(u) = xeeea(u) sand 
|Pa(u)| < |f(u)|4n-! < 2,80 P,(A)x + Ex for all x € H by Theorem 5.2.2d. 
This means that the relation [B, EM = 0 follows from [B, P,(A)] = 0, which is 
certainly trueif Be {e*4: se R}. Bf 


Using functional-calculus rules, this gives us a commutativity criterion for self- 
adjoint operators (compare to Problem 23b). 


5.9.4 Corollary: Self-adjoint operators A, A’ commute iff [e*4,e*4’] = 0 holds 
for all s,te AR. 


5.9.5 Examples: (a) Given a Borel function f : IR’ — R and a Borel measure 
o on JR*, consider the operators of multiplication by e*/( on L?(UR%,do) 
for s € IR. By Example 5.2.8, they can be expressed as fre! dE (zx) , 
where FE‘) is the appropriate spectral measure; the same argument as in 
Proposition 1 shows that they form a strongly continuous unitary group. Its 
generator A is equal to the operator T; of multiplication by f. This follows 
from the estimate |4(e**/)— 1)| < |f(x)| for any s #0 and x € R*, which 
allows us to apply the dominated—convergence theorem to the relation (5.17); 
we find that any ~ € D(T;) belongs to D4 and Ay = Tyy, i.e., Ty C A, 
and since the two operators are self-adjoint they equal each other. 


(b) The dilation x + e*z of the real axis defines for any s € JR the unitary 
operator Uz(s) : (Ua(s)W)(x) = e'/?~(e%x) on L?(IR) (see Example 3.3.2); 
we can easily check that the family { Ug(s) : s € JR} fulfils the group con- 
dition. Using the dominated—convergence theorem we can prove the relation 
lims_.o(¢, Ua(s)¢) = ||||? for any ¢ € C§°UR), and since this set is dense, it 
extends to all ¢ € L?(JR) ; moreover, the polarization identity in combination 
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with the group condition give w-lim,.Ua(s) = Ua(t) for any t € JR, so Ua(-) 
is strongly continuous by Problem 44. 


To find the generator Ay of { Ug(s): s € IR}, consider a vector ¢ € Co°(IR); 
its support is contained in a certain compact interval [a,b]. If 0 < |s| < 1 the 
function f.: fs(x) = 4(e%/*b(e*x)—G(x)) + $¢(x) + ix¢'(x) is continuous and 
its support is again contained in a certain compact (s-independent) interval 
Jes , for instance, Jo, := [a/e,be] for a > 0. We have lim,.o f.(x) = 0 for 
any x € R and (e%/*d(e'x) —g(a)) = de*/*O(e8x) + xe*4/?y'(e*x) , where 
5 belongs to the open interval with the endpoints 0 and s. Hence f, is 
for any s € (-1,1) \ {0} majorized by (e¥/2[|dlloo + 2lz1 €/[\6'lloo)XJus 5 the 
dominated-convergence theorem then yields Aad = (QP — 5)¢, where Q, P 
are the operators introduced in Examples 4.1.3 and 4.2.5, respectively. Finally, 
C%°(JR) is obviously Ug(s)-invariant for any s € JR, and it is therefore a core 
for Ag by Problem 46, i.e., 


Aa = (QP - 5) }C#UR) = 5(QP + PQ)NCHUR). 


The task of finding the generator to a given unitary group is usually easier if 
the group can be expressed by means of “simpler” unitary groups; for instance, 
Proposition 3 shows that {U(s) : s € IR} is reduced by a subspace iff the 
same is true for its generator which is then the orthogonal sum of the “compo- 
nent” generators. As another example, consider unitary groups {U,(s): s © R} 
on H,,r = 1,2, with the generators A,. The operators Ug(s) := U;(s) @ U2(s) 
on H, ® H2 are unitary for any s € R and fulfil the group condition; using the 
sequential continuity of multiplication together with Problem 45, we can check that 
{ Ug(s) : s € IR} is a unitary group. Denote its generator as Ag. The relation 
(5.17) yields Ag(x®y) = A;w@y+2@Agy for all x € D(A;) and y € D(Ag), ie., 
A, + Ay C Ag. However, the operator on the left side is e.s.a. by Theorem 5.7.2, 
so we have proven the following result (see also Problem 49). 


5.9.6 Proposition: The group { Ug(s): s € IR} is generated by Ag = A, +A). 


If Hermitean operators A, B on the same Hilbert space commute, then the 
products e*4e'*8 form a unitary group which is generated by A+B. This need 
not be true if the operators are unbounded but the assertion still makes sense in 
the functional-calculus sense (cf. Problem 33e). No such result is valid in the 
noncommutative case; instead, we have the following limiting relation. 


5.9.7 Theorem (Trotter formula): Suppose that A, B are self-adjoint operators 
and C' := A+B is e.s.a.; then the corresponding unitary groups are related by 


eC = slim (etarngeein)" ,teR. 
of Sod 2) 


Proof: We shall consider only the particular case when C is self-adjoint; references 
to the complete proof are given in the notes. In view of Theorem 5.2.2 we have to 
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check the identity carmen on ae n\n (et ve x = 0; since the norm of the 
operator in the square bracket is < 2 for any t € R, it is sufficient to do so for all 
x of some dense set, say, all x € Dc. We denote K(u) := 4(e™4e%? — e¥°) 
u#0 and K(0) :=0; then the estimate 


| [(et*metP"n)"— (e**)"] 2 


— (itA/nitB/n\* itA/n_itB/n___itC/n\ (_itC/n\"—*-! 
BS (cttletitn) (gtaigatin_ tn) (gto) 
‘a a a | 
n 


= ue i(n—k—1)tC 
=e = |x (= ~) et 1)t [ny 
shows that we have to check limy+o supj.j<jzj |K(u) esa = 0 forall x € Dc. Since 
De = Dan Dg the identity 


for 


< {t| sup 
|s|<|t} 


k=o 7% 


ai iu eiuB_ I ec _ T 


KQtice ee 7 
easily implies limy4o K(u)z = iAx + iBx —iCx = 0; hence there is 5(x) > 0 
such that ||K(u)z|| < 1 holds for |u| < 6(z). On the other hand, the above 
identity gives ||K(u)ax]| < 6||x||/6(x) ; together we find that the function || K(-)«|| 
is bounded, ||K(u)az|| < C, for some C, >0 andall we R. 

We equip De with the inner product [z,y] + (x, y)o := (x,y) + (Cz,Cy). 
The operator C’ is self-adjoint and therefore closed, so we get a Hilbert space 
(Problem 4.11a), which will be denoted Hc. If K € B(H) the restriction K } De 
belongs to B(Hc, H) in view of the inequality ||z|| < ||z\lc ; the uniform bounded- 
ness principle then implies the existence of k > 0 such that ||K(u)z|| < kllz|lo for 
all ue R and x € Dg. Let MC De bea completely bounded set in He, so we 
can find for any € > 0 a finite e-lattice N., and the relation lim,9 K(u)x = 0 
yields ||K(u)y|| < e€ for all y € N.. Using the uniform boundedness proved above 
we get limyosupzey ||K(u)z|| = 0 ; hence it is sufficient to check that the set 
M, := {e*°z: |s| < |t| } is completely bounded in He for any x € Dc. We have 
M, C De because Dg is invariant under e° , and moreover it is not difficult to 
verify that s + ex is continuous for any z € Do also as a map from R to 
Hc. The set M, as the image of the compact interval [—|t],|¢|] is Heo-compact 
and thus, a fortiori, completely bounded in Hc. & 


Notes to Chapter 5 


Section 5.1 The right-continuity condition in the definition of spectral decomposition is 
a matter of convention; some authors, for instance, [AG] or [BS], require {E;} to be 
left-continuous. The direct-product construction easily extends to any finite commuting 
family of projection—valued measures on Euclidean spaces; moreover, the result holds also 
under much more general assumptions — see, e.g., [ BS], Sec.5.2. It should be stressed 
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that while Proposition 9 is analogous to Theorem A.3.12 for numerical-valued measures, 
the measure pz := (x, P(-)x) is not the product of the corresponding measures for & and 
F,, as the reader can easily check. 


Section 5.3 Our proof of the spectral theorem using the Cayley transformation follows 
J. von Neumann’s original argument — see [vN 2] or [vN], Sec.II.9. There are alternative 
proofs. For instance, Riesz and Lorch used the fact that any self-adjoint operator A can 
be expressed as an orthogonal sum of Hermitean operators — see [RN], Sec.120. Another 
way is to construct the spectral measure via the resolvent of A —cf. [ We], Sec.7.3. The 
Fejér theorem claims that a continuous periodic function can be approximated uniformly 
by trigonometric polynomials — see, e.g., | Jar 2], Thm.191. Proposition 8 can be 
proved without reference to Hermitean and bounded normal operators. The argument 
is similar to that of propane 6; one extends the map which associates the operator 
T(U) := Dt__nceU* with a trigonometric polynomial T : T(s) = kon Ck ks — of. 
[ AG], Secs.77—78. 


Section 5.4 More information about the stability of the essential spectrum can be found, 
e.g., in [RS 4], Sec.XIII.4 or [ We], Sec.9.2. Many authors, e.g., [RN] or [RS 1], 
prefer the continuous-spectrum definition of Remark 8; the difference between the two 
definitions usually causes no problems. 


Section 5.5 The Stone formula has various modifications. For instance, using the fact 
that the spectral measure is right continuous, we get the expression 


i EMA) = 4 Bim s-lim [or (u+te) — Ra(u—ie)| du 
: Qn 60+ \ e-04+ Jog a 

for EA) := E,4(—oo,t] ; this relation can serve as a basis for a proof of the spectral 
theorem — cf. [ We], Sec.7.3. Theorem 6 was formulated by F. Riesz (who derived 
inspiration from an earlier result of J. von Neumann) for the case when both A and T 
are bounded, and extended by I. Mimura to the unbounded case — see [RN], Sec.129. 
The separability assumption is substantial; the result may not be valid in a nonseparable 
Hilbert space. Notice also that the investigation of the operators ¢(A) can always be 
reduced to the case when A is bounded — cf. Problem 32b. 


Section 5.6 Inthe proof of Theorem 2 we have used the fact that if two analytic functions 
coincide on a disc, they coincide on the whole common domain of analyticity — see, e.9., 
[Mar], Sec.VI.6. In combination with Problem 35c, Nelson’s theorem shows that a closed 
symmetric operator is self-adjoint iff it has a dense set of analytic vectors. Another 
consequence of Theorem 2 concerns symmetric (not necessarily closed) operators whose 
eigenvectors form an orthonormal basis in . Since each eigenvector is an analytic vector, 
any such operator is e€.8.a. . 


Section 5.8 The proof of Theorem 4 can be found, e.g., in [ AG], Sec.86; [ Nai 2 1, 
Sec.18.6; [| DS 2], Sec.XIJI.5. The theorem also extends to the case when H is not 
separable — see [KGv], Sec.V.2. Let us stress that the spectral representation of a given 


A € Lzq(H) is far from unique, because if A has a generating vector it will have plenty 
of them — cf. Problem 40c. 


Section 5.9 An alternative proof of the Stone theorem uses the fact that for any x € H 
the function (x,U(-)x) is of positive type, which means that the matrix with the elements 
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(x, U(s;— sx)x), j,k = 1,...,n, is positive for any natural n and arbitrary s1,...,$n. 
By Bochner’s theorem (see, e.g., | AG], Sec.70) any function f : R—C of positive type 
is the Fourier image of a non-negative Borel measure on R. Hence we can associate with 
a continuous unitary group a map x +> pz such that (2,U(t)r) = fme“due(t). By 
polarization, it corresponds to a complex measure vz,. Finally, one has to show that there 
is a projection—valued measure E such that (x, E(J)y) = vey(J) holds for any interval 
J and all z,y € H; see [ RN], Sec.138 for details. 

By definition, the generator of a unitary group {U(s) : s € IR} is determined by 
the function U(-) in a neighborhood of the point s = 0. Hence if {V(s): s € R} is 
a unitary group such that U(s) = V(s), |s| < 6 for some 6 > 0, then the generators 
of the two groups coincide and U(s) = V(s) by Stone’s theorem holds for any s € R. 
Notice also that if U(s) is a unitary operator, Proposition 5.3.8 implies the existence of a 
Hermitean A(s) such that U(s) = e*4() . This operator is obviously not unique. If U(s) 
belongs to a strongly continuous unitary group, the same relation holds with A(s) := sA, 
where A is the appropriate generator; a Hermitean A(s) can be obtained by “folding” 
the operator A(s) to the interval [0,2z). 

By Problem 3.21, the set U(H) is a group. Any one-parameter unitary group is 
an Abelian subgroup in it; at the same time it is a unitary representation of the one— 
parameter translation group. Stone’s theorem can be reformulated as the assertion about 
existence of a spectral decomposition corresponding to such a unitary representation. 
This result extends to strongly continuous unitary representations of any locally compact 
commutative group; this is the contents of the so-called SNAG theorem (M.H. Stone, 
M.A. Naimark, W. Ambrose and R. Godement) — see, e.g., | RN], Sec.140. We shall 
need only the generalization of Theorem 2 to the group of translations of the Euclidean 
space JR”, which is given in Problem 48. 

Theorem 7 represents a generalization to the classical Lie formula for matrix semi- 
groups. It was originally proved in {Tro 1]; another proof can be found in [Cher 1], see 
also [Cher]. Some results of these section extend to the case of strongly continuous con- 
tractive semigroups, i.e., families { C(t): t > 0} such that C(-) is strongly continuous, 
||C(t)|| < 1 and C(t+s) = C(t)C(s) holds for all t,s > 0. By a formula that differs 
from (5.17) by the imaginary unit only, one can associate with such a semigroup a unique 
accretive operator A which is called its generator . There is no direct analogy to Stone’s 
theorem; the right side of the expression C(t) = e~4t remains formal. On the other hand, 
the resolvent of A has an integral representation analogous to that of Problem 31b, and 
the Trotter formula also extends to operator semigroups. More information about these 
problems can be found in the monograph [Da 2]. 


Problems 
1. Prove Proposition 5.1.1. 


2. Prove: (a) Let {E:}zem be a nondecreasing family of projections on a given H, i.e., 
E; < Ey if t <u, then the one-sided limits Ey := s-limysu- FE: = supyey, Et 
and Ey+so := s-limpu4 Et = inftpy, EH exist at any u © WR, as well as at 
u = +00 in the first and the second case, respectively, and the operators 
Ey+o are projections. 
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(b) Let E be a spectral measure on R, then s-limyu£({t}) =0 for any u€uR 
and analogous relations are valid at u = too. 


Hint: (b) Use the generalized Bessel inequality — see the notes to Sec.2.3. 
3. Let {E;} be a spectral decomposition on 7. Prove 


(a) to any x € H, the relation o;(t) := (x, E:x) defines a nondecreasing right— 
continuous function such that limy4—oo or(t) = 0 and limt4co o2(t) = |lz|l?, 

(b) the map E defined on the set J of all bounded intervals in IR by E(J) = 
Eyez — Ga, By — Ea; Evo — a-0) Ey — Ea-o for J= (a, b), (a, 5), (a, b), (a, 6 ’ 
respectively, is additive, and E(J)E(K) = E(K)E(J) = E(JN K) holds for 
any J,K EJ, 

(c) if a bounded operator commutes with FE; for any t € AR, it also commutes 
with E(J) for all Je J, 

(d) a projection—valued measure E is generated by {F;}, i.e, E(J) = E(J) for 
all JE J iff E(—oo,t] = E; holds for any te R, 

(e) let a projection P reduce the set {£;}, 1.e, PE; = EP for all te R; 
then t++ PE;P is a spectral decomposition on the subspace Ran P, and the 
corresponding spectral measures fulfil Ep(M) = PE(M)P for any Borel A/. 


4. Let V:H-—-H’' be a unitary operator, then 
(a) if E : B¢ — B(H) is a projection-valued measure, then E’(M):= VE(M)V7-! 
defines a measure on JR? whose values are projections in B(H’), 


(b) if {E;} is a spectral decomposition in H, then {Ei}, Ej := VE;V—!, is a 
spectral decomposition in H’ and the corresponding spectral measures fulfil 
E'(M) = VE(M)V—-! for any MeB. 


5. If y: IR? C isa Borel function and RS® :={\ € ©: B(y-)((A-e, \+e))) #0 
for any € > 0} its essential range with respect to a projection—valued measure E 
(compare to Example 4.3.7), then 


(a) REP (yp) is a closed set which is contained in Rany, 
(b) pe L°(R4,dE) iff max REs3(|pl) < 00, and then |[ylloo = max Ris2((yl) 


2) 


Prove: (a) Proposition 5.2.1. 


(b) If the functions yp, mn = 1,2,..., and y from L©(IR¢,dE) satisfy the relation 
lyn —Ylloo +0, then T(y) = u-limpooT (Pn) « 


(c) The operator 7%(y) is Hermitean (zero, unitary) if the values of the function 
yp are respectively real, zero, belong to the unit circle B-a.e. in IR. 


7. Let E be the projection-valued measure on JR generated by a spectral decomposition 
{Ez} and denote Ag := fptdE(t). If a := sup{t € R: E; =0} > —oo and 
6 := inf{t € R: E, = I} < oo, then the operator Ag is Hermitean and the 
numbers a, 3 equal its lower and upper bound, respectively. 
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8. 


The identity fme v(t) dvzy(t) = fan Y(t) dvye(t) is valid for all 2,y € H and 
y € L®(IR?, dE), and moreover, Vz,T,(¢)y(M) = Sry (t) dvzy(t) for any Borel M. 


9. With the notation of Example 5.2.4, prove that the relations 


10. 


Hee 


Hos 


te. 


ae 


15. 


16. 


[ww dP) =f oan) + f v(u) dru), 


[ev art) = fede [ vuary = f ww ary f g(t) dB) 


are valid for any y € L°(R,dE) and » € L®(IR,dF) (see Proposition 5.5.7 for 
the generalization of these results to unbounded functions). 
Prove: (a) The operator T(y) := fia y(t) dE(t) is bounded iff y € L©(UR*, dE). 


(b) Suppose that a projection—valued measure on JR has an unbounded support 
and y € ®f(R) satisfies the condition limz+.o |y(t)| = oo ; then the opera- 
tor T(y) is unbounded. Could the condition be weakened? 


(c) If yp, € ®g(UR®) and there are c,d > 0 such that d-!|z(t)| < |y(t)| < elw(t)| 
holds for E-a.a. t € JR*, then the corresponding integrals with respect to E 
have the same domains, D, = Dy. 


Hint: (a) Show that the set {t € R*: |y(t)| > ||T(w)||} is E-zero. 
Let y,y € ®gUR*) and M € B*. Prove 


(a) wr(ge(M) = fyy \o(t)/Pdie(t) holds for any & € Dy, 
(b) T(y) is reduced by the projection E(M), 

(c) if the functions y,~ are real-valued, then T(y+iw) = T(y) +iT(v). 
Suppose that V: H — H’ is unitary and E is a measure on R? whose values are 


projections in B(H) ; then ®g¢(JR*) = ®g(IR*) , where E’ is the projection—valued 
measure of Problem 4a, and T'”)(y) = VT)(p)V—! for any y € @g(IR®). 


Let E be a projection—valued measure on JR®. If a projection P commutes with 
E(M) for any M € B*, then PE(-)P is a projection—valued measure on R? with 
values in B(PH) and the relation T'”)(y)2 = PT‘) (yp) Pa = T'PEP)(p)x holds 


for any y € @p(IR*) and all xe PD. 


The identities T(p+w) = 7T(y)+T(w) and T(yw) =T(y)T(v)=T(v)T(y) hold 
for arbitary functions 9,7 € @z(R?). 


Suppose that E is a projection—valued measure on JR? and w : IR¢ > IR” is a map 
whose component functions w; : IR? — IR, j =1,...,n, are Borel; then 


(a) the relation Ey(M) := E(w'-)(M)), M € B”, defines a projection—valued 
measure on JR”, 
(b) y € L©(UR", dE.) implies pow € L@(IR4, dE) and ||Plloo = ||vow]leo - 


Prove: (a) Proposition 5.3.3. 
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(b) If a polynomial P fulfils P(t) > 0 for all t¢ Ja, then P(A) > 0. 

(c) If a real-valued function is continuous on J, , it belongs to K. 

(d) If a function y € K satisfies |y(t)| <c¢ for all t € Ja, then ||T4(y)|| <c. 
(e) T4(P} Ja) = P(A) holds any real polynomial P. 

(f) The function ey := X(~co,ujnz, Ja belongs to Ko for any we R. 


Hint: (b) Any root of P in J4 has an even multiplicity. 


Suppose that M is a compact set in a topological space and {f,} is a sequence of 
real functions continuous on M such that {fn(x)} is nondecreasing for all re M. 
If there is a c such that fn,(2) < c holds for any x € M and some n,;, then 
fu(x) <c for each x € M and all n large enough. 

Hint: M is covered by open sets U(x) such that fr,(x) <c for any y € U(z). 


Let A be a Hermitean operator and FE, its spectral measure; then 


(a) if a function f : R — R is such that f}J,4 € Ko, then f belongs to 
L°(IR, dE) and Tj” (f) =Ta(f} Ja), 


(b) for any interval J C JR, there is a sequence {P,} of real polynomials such 
that E4(J) =s-limpooPn(A). 


Let B be a bounded normal operator on a separable H with a pure point spectrum, 
op(B) = {Ax : k = 1,2,...}, and denote by {P,} the corresponding family of 
projections to the eigenspaces; then Fg: F'g(M) = 3°, xas(Ak) Pe (strong operator 
convergence) is the spectral measure of B. 


Prove: (a) Let B be a bounded normal operator and Fg the corresponding spectral 
measure. To any pair of intervals J, K C IR, there is a sequence of Hermitean 
operators of the form Sp(B, B*) := Ng Seo at?) Br=*(.B* ernie, . 
such that Fa(J x K) =s-limpiooSn(B, B*). 


(b) If U is unitary, then to any t € JR there is a sequence of trigonometric 
polynomials, Ta(u) := Xo, cl” et, such that BY = s-limpoo Deu. 

(c) A bounded operator B commutes with a unitary U iff it commutes with 
RLY) := Ey(—oo,t] for any t € (0,27). 


Hint: (a) Use Proposition 5.3.2 for Re B and Im B. (c) Modify the corresponding 
argument from Proposition 5.3.6. 


Let U bea unitary operator and Ey the corresponding spectral measure; then a 
complex number e**, 0 < a < 27, belongs to op(U) iff Ev({a}) £0, and in that 
case the corresponding eigenspace is Ny(e) = Ran Ey({a}). 


Lets Pa(iya= yin Mp a\”) Fk with Qi" EC be quasipolynomial functions of the 
Fourier-Plancherel operator Fy on LR?) If the sequence {P,(Fu)} is strongly 
convergent, then there are complex c; such that s-limn—ooPa(Fa) = Ses oF). 
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23. 


25. 


26. 
Bis 


28. 


29. 


30. 


31. 


Prove: (a) The resolvent of a self-adjoint operator A can for any yu € p(A) be 
expressed as Ra(u) = fp 4; dEa(t). 

(b) Self-adjoint operators A, Ao, in general unbounded, commute iff their resol- 
vents commute, R4,(A)Ra, (4) = Ra,(u)Ra,(A) for any Awec\R. 

Find examples of self-adjoint operators with 


(a) a pure point spectrum which is not purely discrete, 
(b) a pure point spectrum such that the discrete spectrum is empty, 
(c) eigenvalues embedded in the continuous spectrum. 


Prove: (a) If A is a Hermitean operator on H, dimH = oo, then cess(A) #0. 

(b) A self-adjoint operator A has an empty essential spectrum iff (A—A)7! 
compact for some 2 € p(A), and in such a case, the resolvent is compact for 
any  € p(A). 


Hint: (a) o(A) has an accumulation point. (b) Use formula (3.10). 


Prove Weyl’s theorem: Suppose that A is a self-adjoint operator and C is Hermi- 
tean and compact; then oess(A+C) = Gess(A). 


Let A bea self-adjoint operator; then Hp, C H, and o(A} ti) = ora) Uose(A)). 
Moreover, both gac(A) and os-(A) are contained in gess(A). 


LetwhidR”™ — JR bedefined\by h(21,-..,2%n) := ja 12? ; then the corresponding 
multiplication operator T}, has a purely alseliniely Conieuons spectrum, o(T;,) = 
Cac(Th) — Oa5(Th) => Rt . 


Let A be a self-adjoint operator with a pure point spectrum. Using the notation 
of Example 5.3.11a, prove that any function y defined on ap(A) belongs to &(4) , 
the domain of y(A) consists of all 2 € H such that >>; |y(A;)|?(x, Pjx) < oo, and 
y(A)z = 0; (Aj) Px holds for any x € D(y(A)). In particular, yp € L°UR, dE) 
iff the sequence {p(A;)} is bounded; then y(A) = ¥); y(A;)P; in the strong 
operator topology, while in general the series does not converge with respect to 
the operator norm. 


Consider the operators P on L?(JR) and H := P?. 

(a) Given a bounded interval J Cc JR find a functional realization of Ep(J). 

(b) Find a functional realization of the spectral decomposition (ED), 

Prove: (a) A unitary operator U can be written as U = e*4U , where Ay := 
ja tdEy(t). A number A belongs to a = dA = e* for some s € a(Av) ; 


in particular, le o(U) iff BLY) #0 or Bw) _, #0 holds for any ¢ > 0. 


(b) Let A be a self-adjoint operator and U(s) := e7*4%, s € IR; then the resolvent 
can for any x € H and z€@'\ R be expressed as 


oo . 
Ra(z)a = +i [ e*7*17(+8)a ds, 
0 


with the upper and lower sign for Imz > 0 and Imz < 0, respectively. 
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32. Prove: (a) Let A be a self-adjoint operator and w : JR > JR a Borel function; then 
\ € dp(A) implies w(A) € op(w(A)), while the converse need not be true. 


(b) To any self-adjoint A there is a Hermitean operator A and a continuous real 
strictly monotone function f such that Ran f = R and A= f(A). 


33. Let A,A’ be commuting self-adjoint operators. Using the notation of Section 5.5, 
prove: 


(a) if fr : IR? — IR? is the transposition, fr(t,s) = (s,t), and P denotes the 
product measure of F and E, then the function yr := yofr belongs to Op 
iff ~ € ®p, and in that case, y(A, A’) = yr(A’, A), 


(b) complete the proof of Proposition 5.5.7, 


(c) let y € C(a(A) xa(A’)) ; then a(p(Ax A’)) = y(a(A) xo(A’)). In particular, 
the closure is not required if the operators A and A’ are Hermitean, 


(d) if w : RR? — IR? fulfils the assumption of Problem 15, then (yow)(A, A’) = 
yp(A1, A2) holds for any Borel y : IR? + €, where A; :=w;(A, A’), 


(e) et4eiAA’ — f(A.) holds for any \ € IR, where o(s,t) := s+t. Why can 
the right side not in general be written as e(4+4") 7 


(f) {A, AY = {BC) YN {FO)Y = (POY, 
(g) extend these results to the case of commuting self-adjoint A,,...,An. 


34. Let A bea positive self-adjoint operator; then its domain Dz, is a core for A!/?. 


35. Prove: (a) Any self-adjoint operator has a dense set of analytic vectors. 


(b) Suppose that the operators T and S are unitarily equivalent, T= USU-!, 
and 2x is an analytic vector of S ; then Uz is an analytic vector of T and 
rs(x) = rr(U2). 


(c) Let A be symmetric. Suppose that a subspace D C Dy is A-invariant and 
the analytic vectors of A are dense in D; then A is e.s.a. on D. 


(d) Find a symmetric operator A with C®(A) = {0}. 


Hint: (a) Consider the vectors E4(—n,n)x for an arbitrary x € H . (d) Restrict 
the operator P of Example 4.2.5 on L7(0,m) to {sp : k = 1,2,...}uin, where 
84(2) := (2/r)!/? sink. 


36. Each of the vectors hn given by (2.1) is an analytic vector of the operators Q and 
P on L(IR). 


37. Suppose that A € Lsa(H) and n>2; then |/A*a|| < ||A"x||*/"\|x||"-*)/" holds 
for all x € D(A") and any positive integer k <n. 


38. Let AS E Leslie), r= 12. 


(a) Prove Proposition 5.7.3. 
(b) op(A,) = op(Ar) ; if Ay has a pure point spectrum the same is true for A,. 
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39. 


40. 


at. 


42. 


48. 


44. 


(c) f(A,) = f(A1) @ Ig and f(Ay) = Ti @ f(Az) holds for an arbitrary real Borel 
function f ; the closure is not needed if f is bounded. 


Consider the operator (5.15) corresponding to given A,, Az. Prove 


(a) P(op(A1) x op(A2)) C o(P[A1, Ag]); the relation turns into identity if the 
spaces +, are separable and the spectra of the operators A, are pure point, 


(b) in the general case, the two sets need not be equal. 


Hint: (b) Consider the operator A; ® A2, where Aj is a nontrivial projection of 
finite codimension and A2:= Q on H2 := L?(R). 


Prove: (a) If V: H —H’ is a unitary operator and y € H is a generating vector 
of some A € Lsa(H), then y’ := Vy is a generating vector of VAV~!. 

(b) Suppose that A € Lya(H) has a simple spectrum and the operator Q, on 
L?(IR, du) is its spectral representation; then y(A) for any complex Borel 
function ¢ is unitarily equivalent to the operator Ty, on L?(IR, dy). 

(c) If y is a generating vector of A € Leq(H) and B € B(H) is such that BAC 
AB and Ran B=H, then the set { F4(J)By: JE 7} is total in H. 


Using Theorem 5.8.3, prove 


(a) the measure pz := (x, 4(-)x) for any x € H is generated by the function 
Wz := Vax and the measure p, 1.€., Ur(M) = fay |H2(t)|?du(t) holds for all 
M €B. In particular, any p-zero set is E4-zero, so L*(IR,du) c &4), 

(b) the generating vector y, for any ~ € L*(JR,du), belongs to the domain of 
w(A) and V-!w = W(A)y. This implies, in particular, that {W(A)y: vy € UV} 
is total in H for any total set VC L?UR, du). 


Let A € Loa(H) on a separable H have a pure point spectrum. Without reference 
to Theorem 5.8.6, prove that o(A) is simple iff {A}’ = {A}”. 


Let A € L4q(H) . Prove: 


(a) If there are y € H and and aset & c $4) such that y € D(y(A)) for all 
ye® and {y(A)y: ye ®} is total in H, then the spectrum of A is simple. 

(b) If A is bounded, then its spectrum is simple iff the set { A"y: n =0,1,...} 
is total for some y € Ht. 

(c) Suppose that there is y € H such that { By: Be {A}"} =H; then for any 
Hermitean C € {A}” and ¢ > 0 we can find a Hermitean operator B € {A}” 
such that ||(C—B)yl| <e. 


Hint: (b) Use (a) and Problem 41 for the set of all polynomials. (c) We have 
B* € {A}” for any B € {A}” and C—B is normal since {A}” is commutative. 


Let {U(s): s€ R} bea family of unitary operators on H fulfilling the group 
condition U(t+s) = U(t)U(s) for all t,s € IR. The map s+ U(s) is strongly 
continuous iff it is weakly continuous. If these conditions are valid, the group is 
operator—-norm continuous iff its generator is a bounded operator. 
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Let D be a dense subspace in H and suppose that a linear operator Upo(s) on 
D corresponds to any s € JR in such a way that Uo(s)D C D and Up(s)D =H; 
moreover, ||Uo(s)z|| = ||x{| holds for all x € D, the map s++ Up(s)z is continuous 
in IR, and Up(t+s)x = Uo(t)Uo(s)x holds for any t,s € IR. Denote by U(s) 
the continuous extension of Uo(s) ; then {U(s): s € IR} is a strongly continuous 
unitary group. 


Let A be a self-adjoint operator on . If a subspace DC Dag is dense in H and 
e*AD C D for all t€ RR, then D is acore for A. 
Hint: Use the argument from the proof of Theorem 5.9.2. 


Consider strongly continuous unitary groups {U(s) = e*4: se IR} and {U(s) = 
er: ge IR }. They are unitarily equivalent, i.e., there is a unitary operator V 
such that U(-)=VU(-)V—! if A=VAV-!. 


Prove the following generalization to Theorem 5.9.2: suppose that a unitary—valued 
map U : IR” — B(H) is strongly continuous and U(t+s) = U(t)U(s) holds for all 
t := [t1,...,tn] and s :=[s1,...,8n] € JR” ; then there are commuting self—adjoint 
operators Aj,...,An such that U(t) := et(4i+-+tAn) holds for all t € IR”. 
Hint: Use Corollary 5.9.4 and Proposition 5.5.7. 


It follows from Proposition 5.9.6 that U,(s) = e*4-, r = 1,2, holds for all se R. 


Prove that these relations in turn imply Ug(s) = e841 +42) . 
Hint: Use (5.16) and Problem 33e. 


Without reference to Example 5.5.1b, show that the translation operators U;(s)y := 
w(-+ 8) form a strongly continuous group which is generated by A; := P. 


Consider rotations of the plane, [x,y] + [vo, ys] with xo := rcos@—ysin@ and 
ye := xsin@+ycos@. Define the operators U,(8) on L?(UR?) by (U,(@)v)(a, y) := 
(xe, ye) and show that {U,(6) : 6 € R} is a unitary group, its generator A, is 
e.s.a. on Ch”) (IR), and (Agd)(a, y) = —i(eOy—yr)o(a,y) for all ¢ € CL UR?). 
Hint: Modify the argument of Example 5.9.5b. 


Chapter 6 


Operator sets and algebras 


Up to now we have been discussing properties of single operators or certain commu- 
tative operator families. Now we turn our attention to more complicated operator 
sets. An efficient way to study their structure is to use algebraic methods; this 
is the main topic of this chapter. A drawback of this approach is that the sets 
under consideration must be closed under algebraic operations, which is ensured 
automatically only if the involved operators are bounded; however, we already know 
that in many cases properties of unbounded operators can be studied through their 
bounded functions. Algebras of unbounded operators can also be introduced; we 
shall mention this topic briefly at the end of the chapter. 


6.1 C*-algebras 


Consider a topological algebra A, which is equipped with an involution — we refer 
to Appendices B.1 and B.2 for the necessary prerequisites. If the involution is 
continuous with respect to the given topology, A is called a topological *-algebra. 
If A is a normed algebra and |{a*|| = ||a|| holds for any a € A we speak about a 
normed *-algebra; the involution in A is obviously continuous. In a similar way 
we define a Banach *-algebra. 


6.1.1 Example: The algebra B(H) provides an illustration that the first of the 
above definitions is nontrivial. We know that B(H) is a *-algebra and at the same 
time a topological algebra with respect to 7, (cf. Example B.2.2). However, by 
Problem 3.9a the involution is not continuous in the strong operator topology unless 
dim H < oo. On the other hand, the weak operator topology makes B(H), as well 
as any *-subalgebra of it, a topological +*-algebra; algebras of this type will be 
discussed in Section 6.3 below. Furthermore, in view of Theorem 3.1.2a, B(H) is a 
topological *-algebra with respect to the operator-norm topology. Recall that by 
Example 3.2.3 it has an additional property, namely that ||/B*B|| = ||BI|? holds for 
any operator B € B(H). 


Inspired by the last mentioned relation, we can formulate an important defini- 


205 


206 CHAPTER 6. OPERATOR SETS AND ALGEBRAS 


tion. A Banach *-algebra is called a C*—algebra if 
\|a*al| = |||? (6.1) 


holds for all a € A. The example then says that B(H) with the operator-norm 
topology is a C*-algebra and the same is, of course, true for its closed *-subalgebras. 
Notice that the involution must be isometric if condition (6.1) is valid: we have 
\la||?= ||a*al| < |ja*|| |lall, 2-e., [lal] < |la*]|, and similarly |ja*|| < jal. 

Let us review the basic properties of C*-algebras; without loss of generality we 
may consider only those algebras with the unit element (see the notes). First we 
want to show that morphisms of C*-algebras are automatically continuous; this is 
a consequence of Proposition B.2.5a and of the following assertion. 


6.1.2 Proposition: Let y be a *-morphism of a Banach *-algebra A into a 
C*-algebra B; then ||y(a)|ln < |lal|4 holds for any a€ A. 
Proof: Suppose first that A has the unit element; then ¢(.A) is a subalgebra in 
B with the unit element ¢(e), so os(y(a)) C oy)(p(a)) holds for any a € A. 
Moreover, (y(a)—Ay(e))~!= y((a—Ae)~') exists provided a—Xe is invertible, which 
means that o,,4)(y(a)) C o4(a). In combination with Theorem B.2.4d, it implies 
the inequalities r(y(a)) < r(a) < |la||4. Let 6 be an arbitrary Hermitean element 
of B. By induction we get Wore” = ||b||s where m = 2” ; hence the limit n — 00 
yields r(b) = |lb|ls. 

By assumption y is a *-morphism, so the element y(a*a) is Hermitean and it 
is sufficient to combine the above results, 


lv(a)lls = lle(a*a)ls = r(p(a*a)) < |la*all4 < ja |lallalls = lal. 


If A or both algebras have no unit element, we can extend them to the Banach 

s-algebra A and the C*~algebra B, which have the unit elements. The assertion 
is valid for the *-morphism ¢% : (ae,+a) = aeg+y(a), and thus, a fortiori, for 
its restriction p= G}hA. ff 


By Theorem B.2.4d, the spectral radius in a Banach *-algebra depends on the 
element itself. In C*-algebras this result can be substantially strengthened. 


6.1.3 Theorem: Suppose that A is a C*-algebra and B its subalgebra containing 
the unit element of A; then og(b) = o4(b) holds for any element b€ B. 


The relation o4(6) C op(b) is obvious. To prove the opposite inclusion we need 
an auxiliary result. 


6.1.4 Lemma: Let A be a Banach algebra with the unit element. 


(a) If {a,} is a Cauchy sequence in the set FR of all invertible elements of A and 
@:=liMpodn € bdR:= R\R; then limpoo ||az!|} = oo 


(b) Let BC A be a closed subalgebra containing the unit element of A ; then 
bd (o8(b)) C o4(b) holds for all be B. 
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Proof: If (a) is not valid, there is K such that ||a;'|| < K for infinitely many values 
of the index n. At the same time, ||a,—a || < K~ holds for all sufficiently large n, 
so we can choose n such that |/e—az!a,|| = \laz!(a,—a)|| < 1. By Theorem B.2.4a 
and Proposition B.1.la, elements az'a and a are then invertible; however, this 
contradicts the assumption a € bd because the set R is open. 

Denote the sets of all invertible elements in A and B by Ry and Rez, respec- 
tively. They are open and Rg C Ry. For any b € bdRg we can find a sequence 
{b,} C Rg such that b, > b; if 6 belonged simultaneously to Ry, then b5! > b-! 
by continuity of the inversion, i.e., the sequence {||b;'|]} would be bounded. Since 
this contradicts assertion (a), we have R4Nbd Rg = @. Furthermore, we can easily 
check that 1 € bd(pg(b)) implies b—Ae € bd Ry, so b—Ae ¢ Ry ; in other words, 
A € o4(b). This means that o,(b) contains the set bd(pg(b)) ; however, the latter 
coincides with the boundary of the complement, bd(pg(b)) = bd(zg(b)). 


Proof of Theorem 3: The elemert a*a is Hermitean for a € A, so oy(a*a) C R 
by Problem 2b. Hence each point of the spectrum belongs to its boundary (with 
respect to the complex plane) and Lemma 4b gives 


op(a*a) C bd(og(a*a)) C o4(a*a) C op(a*a), 


i.e., op(a*a) =o 4(a*a). Next we want to check the same relation for an arbitrary 
element 6 € B. We choose  ¢ o.4(b) and denote a := b—Ae ; we have to show 
that \ ¢ op(b), i.e., a7! € B. This is true for c:= a*a; since it is invertible, c~! 
belongs to B. The element a; := (a*a)~!a* is contained in B and aja = e, and 
similarly, a, := a*(aa*)~! belongs to B and aa, = e. However, the inverse in B is 
unique so a;3=a,=a EB. & 


The structure of a C*—algebra simplifies considerably in the commutative case 
(see the notes; compare also with Theorem B.2.6). 


6.1.5 Theorem (Gel’fand—Naimark): A commutative C*-algebra with the unit 
element is isometrically *-isomorphic to the algebra C'(A) of continuous functions 
on a certain compact Hausdorff space A. 


This structural result can also be employed to solve various problems in noncom- 
mutative C*-algebras; we have to select a commutative subalgebra and represent it 
functionally. The simplest case corresponds to the situation when this subalgebra 
is generated by a single element a € A; in this way we can construct the func- 
tional calculus on C*-algebras in analogy with the results of Section 5.2. The same 
idea provides an alternative proof of the spectral theorem for Hermitean operators; 
references are given in the notes. 

Let us finally describe briefly how a new C*-algebra can be constructed from a 
given family of C*-algebras. There are several ways, namely 


(i) restriction to a closed *-subalgebra in a C*—algebra, 


(ii) factorization of a C*-algebra with respect to a closed »~ideal, 
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(iii) let {Ag : a € I} bea family of C*-algebras. We choose the subset A := 
{a= [ag]: da € Aa, |lalleo := SUPge ||@alle < 00 } in their Cartesian product; 
since any A, is a Banach space, A may also be regarded as a Banach space 
(see Section 1.5). In addition, if we introduce the operations of multiplication 
and involution componentwise, [aa}[ba] := [@aba] and [aq|* := [aj]; then A 
becomes a C*-algebra (cf. Problem 4), which is called the direct sum of the 
C*-algebras A,, a € I, 


(iv) consider finally C*-algebras A), Ap with the norms ||-||;, 7 = 1,2. A bilinear 


map @: A, x A, — A, where A is a C*-algebra with the norm ||- || , which 
satisfies the requirements (a; @a2)(b; ®@b2) = a1b) @aeb2, (a1 @a2)* = aj ay 
and |la1 @ ael| = |lai||:||aall2 for any a; € A; and such that the set {a1 @ae: 


a; € A;} is total in A is called a realization of the tensor product of A, and 
A2. One can show that such a realization always exists and it is essentially 
unique — cf. Problem 5 and the notes. 


6.2 GNS construction 


The properties of C*-algebras derived above do not exhaust the consequences of 
condition (6.1). Using it, for instance, we can derive various spectral properties for 
elements of C*-algebras analogous to those of bounded operators (Problem 2). This 
similarity is not accidental; we are going to show now that any C*-algebra can be 
represented faithfully in some B(H). 

First we have to introduce the notion of positivity. Let A bea C*-algebra with 
the unit element. An element a € A is said to be positive if it is Hermitean and 
a(a) C [0,00). This is written symbolically as a > 0, and moreover, a > 6 means 
a—b > 0. Aset P in a vector space is called a cone if the elements aa, a+b belong 
to P for all a,b€ P,a > 0. A cone is always a convex set: if a, 1—a are nonzero 
numbers, then aa+(1—a)a € P holds for any a,b € P. Positive elements have the 
following properties (see the notes). 


6.2.1 Theorem: Let A be a C*-algebra with the unit element; then ata > 0 
holds for all a € A, and conversely, any positive element b € A can be expressed 
as b= a"*a for some a€ A. The sets Ax := {a € A: +a > 0} are closed cones 
in A such that A,MA_ = {0}. 


A linear functional f ona *-algebra A is positiveif f(a*a) > 0 forall ae A. 
Let A be a Banach *-algebra; then a positive functional f on A normalized by 
| f|| = 1 is called a state (on the algebra A). Positivity of a functional is denoted 
as f > 0 while f > g again means f—g >0. If A is a C*-algebra, a functional 
f is positive, by Theorem 1, iff f(b) > 0 holds for all positive elements b € A. 


6.2.2 Proposition: Suppose that f is a positive functional on a C*~algebra with 
the unit element, and a,b are arbitrary elements of A; then 


(a) f=0 if f <0 holds at the same time, 
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(b) f(a*) = f(a), 

(c) the generalized Schwarz inequality is valid, |f(a*b)|? < f(a*a) f(b") , 
(d) |F(a)|? < Fle) f(a*a) < f(e)?r(a*a), 

(e) f is continuous and ||f|| = f(e). 


Proof: By assumption, f((a*+ @e)(a + ae)) > 0 ; choosing a = 1,1, we find 
that f(a)+f(a*) and i(f(a*)—f(a)) are real, i.e., assertion (b). Furthermore, (c) 
follows from the fact that [a,b] + f(a*b) is a positive symmetric sesquilinear form; 
substituting b =e we get the first inequality in (d). The element c := r(a*a)e—a*a 
is positive, so f(c) > 0, and since f is linear we have f(a*a) < f(e)r(a*a). Next 
we use (6.1) in combination with Theorem B.2.4d; it yields 


If(a)l? < fle)f(a*a) < f(e)*r(a*a) < f(e)"lla*al| < f(e)*llal|’. 


This proves (d), the continuity of f, and ||f|| < f(e) ; the opposite inequality 
follows from |le|] = 1. It remains for us to prove (a). By assumption f(a*a) = 0 
holds for any a € A; hence (d) implies f(a) =0. 


After these preliminaries, let us turn to the mentioned result concerning the 
existence of a faithful representation for any C*-algebra. The proof is based on a 
constructive method devised by I. Gel’fand, M. Naimark and I. Segal. 


6.2.3 Theorem (GNS construction): Let A be a Banach *—algebra with the 
unit element. For any positive functional f on A there is a Hilbert space H 
and a representation : A — B(H) with a cyclic vector % such that f(a) = 
(Wo, 7(a)wo) holds for any ac A. If {H’',7’, Wp} is another triplet with the same 
properties, then there is a unitary operator U € B(H,H') such that Yj = Uy and 
Un(a) =7'(a)U for all aE A. 

Proof: The starting point for the construction of H is the algebra A itself. The 
form y: Ax A— © defined by y(a,b) := f(a*b) is sesquilinear, symmetric, 
and positive. Hence we can use it to construct the inner product; however, since 
f(a*a) = 0 does not imply a = 0 a factorization is needed. By Proposition 2c, 
J; :={a: f(a*a) =0} is a subspace in A, and moreover, 


|F((ab)*ab)|? < f((a*ab)*a*ab) f(b*b) = 0 


holds for all a € A, b € Jy, so Jy is also a left ideal. On the factor space 
A/J; we define an inner product by (4,6) := f(a*b), where a,b are some elements 
representing the classes a,b € A/J;. Of course, (a,b) must not depend on the 
chosen representatives, z.e., f(a*b) must be zero if at least one of the elements a,b 
belongs to J; ; this follows from Proposition 2c and the relation f(a*b) = f(b*a). 
The form (-,-) is obviously sesqilinear, symmetric, and positive; (@,a@) = 0 holds 
iff &@= J+, which is the zero element of A/J;. In this way we have defined the 
inner product on A/J+ ; the Hilbert space H is then obtained by the standard 
completion of the factor space A/J7;. 
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Let us turn to the representation 1. We define the operator (a) on A/ i 
for any a € A by T(a)b := ab, where 6 is an element representing the class 6 ; 
this makes sense because J; is a left ideal in A. The operator 79(a) is obviously 
linear; let us check that it is bounded. We have 


ll70(a)b||? = (ab, ab) = f(b*a*ab) = yo(a*a) 


for any b € A/J;, where yo(c) := f(b*cb). The identity implies, in particular, that 
the functional y» is positive, so Proposition 2e yields 


II70(a)b||? < yo(e)|la*all.a < F(O*b)llall% = Ill? H1al12 . 


By Theorem 1.5.5, (a) has then a unique continuous extension to the space #1, 
which we denote as (a). The norm remains preserved, ||7(a)||s(1) < |lal|.4. Since 
A/J; is dense in H, to prove that 7(-) is a representation of A it is sufficient to 
check the relations 


r(aatbé=an(ale+n(b)é, m(ab\e=n(a)r(b)e, (é,2(a*)d) = (E,2(a)*d) 


for any a,b€ A, é,d€ A/J; and a€ C. The first two of these follow directly from 
the definition, and (é,(a*)d) = (@,a*d) = f(ctatd) = (G@,d) = (x(a)é,d) yields 
the third. Furthermore, 7(A)é = A/J; and A/J; is dense in H, so yp :=€é isa 
cyclic vector of the representation 7 and 


(Ho, r(a)yo) = (€,4) = flea) = f(a) 


holds for all a € A. Hence we have proven that there is at least one triplet 
{H,7,Wo} with the required properties; for the sake of brevity we speak about 
a GNS triplet (corresponding to algebra A and functional f) and 7 is called a 
GNS representation. 

Assume finally that {H’',72’,yo} is another GNS triplet. We denote Ho := 
T(A)to = A/F; and Ho := 1'(A)yG, and define the operator Up : Ho > Hg by 
Upr(a)yo := m'(a)y for any a € A; the relations 


[Vor (a)vollty = (Ho, ("alow = f(a*a) = |\r(a)%o|l? 


show that Up is bijective and norm-preserving. At the same time it is linear, and 
since the subspaces Ho, Ho are by assumption dense in H and H'’, respectively, 
it can, due to Proposition 3.3.3a, be extended continuously to a unitary operator 
U: H—H’. Substituting a := e into the last relation we get Uw = wi, and 
consequently, Um(a)Wo = 1'(a)Uyp. We apply the operator 1/(b) to both sides of 


this identity, 
t'(b)Un(a)yo = x'(ba)Uyo = Un(ba)yo = Ur(b)r(a)yo , 


and since m(A)jo is dense in H, it follows that 7’(b)U = Un(b) holds for any 
element be A. fi 
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6.2.4 Examples: (a) Let A be a closed subalgebra in B(H) containing the unit 
operator. Given an arbitrary vector 7 € H, we define the functional 


fet fu(B) = (w, By) forall B EA. 


It is obviously positive. Moreover, the subspace Az C H is invariant with re- 

spect to the operators of A, so Hy := Av is also invariant (Problem 1.33c). 

This allows us to define the map 7, : A — B(H,) by 7y(B) := By := 

B)Hy. It is not difficult to check that {Hy,7y,w} is a GNS triplet cor- 

responding to the functional fy. A particularly simple situation occurs if 

Hy =H; then zy is the identical mapping and w is simultaneously a cyclic 
« vector of the operator algebra A. 


(b) Not every positive functional on a C*-algebra AC B(H) can be expressed in 
the above form. To describe a more general case, consider a statistical operator 
W on H (ef. Section 3.6) with the spectral decomposition W = Th, weEk , 
where N := dim(KerW)+ and the E, are one-dimensional projections corre- 
sponding to normalized eigenvectors ~, with the nonzero eigenvalues w,, k = 
1,...,N; the latter fulfil the condition TrW = D%_, w, = 1. Using operator 
W we define the positive functional 


N 
fw: fw(B) = Tr(WB) = >) wel, By) 
ci 


on A. The subspaces Hy, := Avy, are again invariant with respect to the 
algebra .A ; we construct the Hilbert space Hw := @_, HW , where the spaces 
HY are obtained from H, by replacing the inner product with (-,-), := 
wz(-,:). In other words, the elements of Hw are sequences © = {¢,}i_, 
with ¢@, € He such that ||®||w < oo ; here the norm ||- ||w is induced by 
the inner product (®,V)w := SM, we(de, Ye). Given an operator Be A 
we define tw(B) := Bw on Hw by 


Bw{bihear = {Bo}ear 3 


it is easy to see that ||By|| < ||B||. If A contains the unit operator then the 
normalized vector Vw := {v}i, and (Vw, tw(B)Yw)w = fw(B) holds 
for any Be A. 


The map tw : A — B(Hyw) is a representation of algebra A. However, 
this does not mean that {Hw,7w, Vw} is a GNS triplet corresponding to 
fw, because Vy need not be a cyclic vector of mw. The algebra ('(H) := 
{al : a € €} of scalar operators can be taken as an example; the set 
tw(C€ (H))Yw = (Vw)un is clearly not dense in Hy unless N = 1. On 
the other hand, we shall show that Uy is cyclic for tw if the projections 
Ex, k = 1,...,N, belong to algebra A. To check this claim, suppose that 
(®, tw(B)Uw)w = hii welds, Bde) = 0 for © € Hw andall Be A. 
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Choosing B = CE, with C € A we get (¢n,C%n) = 0 for n=1,...,N ; 
however, the set Ay, is dense in H,, so ¢, = 0 for all n, and also ®=0. 
In this case {Hw,7w, Yw} is the sought GNS triplet corresponding to the 
functional fw. Notice also that the representation ty is reducible unless 
N = 1; it follows from the definition that mw(B) = @e, 7: (B), where 
7.(B) := B)H, for any BE A. 


(c) A GNS representation need not be faithful. To illustrate this claim, consider 
a nontrivial orthogonal-sum decomposition H := H; @ H2 and the algebra 
A := B(H;) ® B(He). If the functional fy, used above, corresponds to a 
nonzero vector ~ € H,, the GNS representation is given by 7y(B) = B, for 
any B := B, @ By because Hy = Hy. Its kernel Ap := {BE A: B, =0} 
is obviously nontrivial and forms a closed ideal in algebra A (not in B(H), 
of course!). 


Example 4c shows that Theorem 3 itself does not solve the mentioned prob- 
lem. In fact, we want to show that any C*~-algebra A has a representation 
nm : A — B(H), which is not only faithful but at the same time reproduces the 
metric properties. This requires the map 7 to be simultaneously an isometry; in 
that case we speak of an isometric representation. To construct such a represen- 
tation we need a rich enough family of positive functionals on A. Its existence is 
ensured by the following result. 


6.2.5 Proposition: Let A bea C*-algebra with the unit element. For any nonzero 
element a € A there is a positive functional f, such that fa(e) = 1 (so fy isa 
state on A) and at the same time, f,(a*a) = |la|l?. 


Proof is a modification of the argument used to prove the Hahn-Banach theorem 
(see the notes). First one constructs a functional f with the required properties on 
the set Ap of all Hermitean elements in A, which forms a real Banach space. We 
set f(ae+Ga"a) := a+ Glla*a|| on the subspace {e,a*a}jin and extend it by adding 
vectors. The most difficult part is to check that the positivity is preserved at each 
step; the procedure is completed by Zorn’s lemma. Having constructed f we define 


fa: fold) = f((b+b")/2)+if((b—b°)/2i). 


6.2.6 Theorem: An arbitrary C’*-algebra A has an isometric representation 7 : 
A — B(H) on some Hilbert space H. 


Proof: Without loss of generality we may assume that A has the unit element. By 
Proposition 5, we can associate a positive functional f, with any nonzero element 
a e€ A, and furthermore a GNS triplet {Ha,7a,%a} corresponds to fa. We define 
the Hilbert space in question as the direct sum, H := @ozac4 He ; recall that any 
vector {¢.: 0#a€ A} €H has at most countably many nonzero components. 
The representation 7 is constructed as the direct sum of representations 7, : we 
set 1(b)® := {7a(b)da : O 4 a € A} for any b € A. Proposition 6.1.2 implies 
I|a(b)|| < Holla, and therefore ||m(b)|| = supozac.a ||%a(b)|| < |lbll4, ae., the oper- 
ators 7(b) are bounded. It is easy to check that the map 7: A — B(H) isa 
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representation. It is also isometric: cyclic vectors 7%. are normalized by Theorem 3 
and Proposition 5, ||w,||2 = fa(e) = 1; hence 


Iro(O)II? > Ilto(b)dolls = (Yr, mto(b)*15(b)yr)o = fold") = [IZ 


In combination with the above estimate, this yields |b||4 < ||7.(})|| < |lr(0)|| < 
|bl|4 for any 6 € A, which concludes the proof. 


The representation z constructed in the proof is, of course, faithful but it does 
serve practical purposes; the space # is extremely large (recall that any vector 
has infinitely many components corresponding to multiples of a single element of 
A) and the representation 7 is “excessively reducible”. We usually employ other 
methods to construct isometric representations of C*-algebras; the importance of 
Theorem 6 is that it guarantees their existence. 

In conclusion let us mention another application of the GNS representation. 
Let A again be a C*-algebra with the unit element and let us denote by Sy, the 
family of all states on A. It is easy to see that S4 is convex: the functional 
f := af;+(1—a)fe is obviously positive for any f,, fo € S,4 and non-negative 
numbers a, 1—a, and ||f|| = f(e) = afi(e)+(1—a)fe(e) = 1, 2e, f is a state 
on A. In the notes to Sec.1.1 we have defined an extremal point of a convex set; 
the extremal points of Sy, are called pure states. The following important result is 
valid (see the notes). 


- 6.2.7 Theorem (Segal): Let A be a C*-algebra with the unit element. A state 
f € Sy is pure iff the corresponding GNS representation 7; is irreducible. 


6.3 W*—algebras 


Our next topic deals with algebras of bounded operators on some fixed Hilbert space 
H. In addition to the operator C*—algebras considered in the preceding sections, 
i.e., norm-—closed *-subalgebras in B(H) , we can in this case also define other classes 
of algebras due to the fact that B(H) can be equipped with different topologies. 


6.3.1 Remark: The topologies 7, Ts, Tw of Section 3.1 are not the only ones which 
can be defined on B(H). We shall describe two more topologies. Let H be the set 
of all sequences © = {¢,} C H such that 5; |l¢xl|? < 00. We set 


po(B) := (3° 126.1") ‘i 


for all @ € H and B € B(H). It follows from the Minkowski inequality that 
pe : B(H) — (0,00) isa seminorm, and it is easy to check that the family {ps}. := 
{pe: PE H} separates points. The corresponding locally convex topology on 
B(H) is called o-strong. In a similar way, we define the seminorms 


uh) lee Bee) 
k=1 
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and the system {pey}ow := {paw : &, VE H} determines the o-weak topology on 
B(H). Both definitions are conveniently reformulated if we regard H as the direct 
sum H = Oe, He, with H, =A, =...=H equipped with the inner product 


(6, = Set). 
k=1 


If we use B to denote the direct sum of identical copies of operator B, BO := 
{Bo}, then the seminorms are expressed as pe(B) = ||BP\|q and pay(B) = 
|(®, BV)»|. The five topologies we have introduced on B(H) are related by 


U U 
T, D Ty 


where all the inclusions are nontrivial unless dim < oo (see Problem 6 and the 
notes). The indices appearing here will be used in the following to distinguish these 
topologies, e.g., (S)ow will mean the o—weak closure of set S, etc. 


After this preliminary, let us introduce a new class of operator algebras. For 
a set S C B(H), the definition of the commutant from Section 5.5 simplifies to 
S':= {B € B(H): BC =CB,C € S}, and we introduce the bicommutant by 
S" := (S’)’. One should keep in mind that these definitions are related to B(H) ; 
the commutant of a subset S in an operator algebra A introduced in Appendix B.1 
is in the present notation equal to S’M A. If S is a symmetric subset in B(H), 
then S’ and S” are by Proposition B.1.3f »*-subalgebras in B(H). Hence the 
following definition makes sense: a *-algebra AC B(H) is called a W*—algebra 
(or a von Neumann algebra) if it coincides with its bicommutant, A= A”. 

Though the notion is introduced in a purely algebraic way, W *-algebras have an 
equivalent topological characterization. By Proposition B.2.1c and Example B.2.2, 
a W*-algebra A is closed with respect to weak operator topology. Moreover, taking 
into account the relations between the topologies on B(H) and Example 6.1.1, we 
arrive at the following result. 


6.3.2 Proposition: Any W*-algebra A C B(H) is weakly closed. As a conse- 
quence, it is closed in all the other topologies of Remark 1; in particular, it is a 
C*-algebra. 


It is natural to ask whether an arbitrary weakly closed *-subalgebra in B(H) is 
at the same time a W*-algebra, or whether this is true for any operator C*-algebra. 
In both cases the answer is negative. 


6.3.3 Examples: (a) Let E € B(H) be a nontrivial projection. The algebra 
A := Ao({E}), which consists of all multiples of E, is weakly closed, but it 
does not contain the scalar operators which belong to Aw({E}) (Problem 8). 
Here Aw(S) means the smallest W*-algebra containing S” ; it is easy to see 
that A has to be completed with multiples of the unit operator. 
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(b) Let Q be the operator of Example 3.2.2 on a finite interval J := [a,}]. In view 
of Theorem 5.2.2, Ac := { f(Q): f € C(J)} is isometrically +*-isomorphic 
to the functional C*-algebra C(J) ; hence it isa C*-subalgebra in B(L?(J)). 
To check that it is not weakly closed, we have to choose a suitable sequence 
{gn} C C(J), €.9., gn(x) := max{1,((r—a)/(c—a))”} for some c € (a, 0). 
It follows from the already quoted Theorem 5.2.2 that w-limptogn(Q) = 
X{c(Q) , so this operator belongs to (Ac)w but not to Ac. 


We therefore want to find a necessary and sufficient condition under which a 
weakly closed *-algebra is a W*-algebra. We notice first that we can associate 
with any set S C B(H) a projection Es such that BEs = B holds for all BES 
(Problem 8c). Set S is called nondegenerate if Es = I ; this is true, in particular, 
if S contains the unit operator. 


6.3.4 Theorem: Let S be a symmetric subset in B(H) ; then 
(a) As :={BeS"”: B=EsBEs} is a weakly closed *-subalgebra in B(H), 


(b) the algebra A,,(S) := (Ao(S)). coincides with the o-strong closure (Ao(S))as , 


(c) Au(S) = 


Proof: The set As is obviously a *-subalgebra in S”. If B € (As) ; then for any 
- €>O0 and ¢,v~ € H there is C € Ag such that B—C is contained in the weak 
neighborhood W.({¢}, {w, Es, w}) of zero (cf. Section 3.1). The relation CEs = C 
then implies 


I(¢, BU-Es)#)| < \(¢, (B—C)#)| + (6, (B-C)Esy)| < 2€, 


so (¢, BUI—Es)w) = 0 for all ¢,f € H, i.e., B= BEs. In the same way we prove 
B= EsB, which together gives assertion (a). 

The most difficult part of the proof is the inclusion As C (Ao(S))os. In view 
of Problem 7, it is sufficient to show that any o-strong neighborhood U,(C;®) of 
an operator C' € As contains some operator B € Ap(S). We use the notation of 
Remark 1 and define E as the projection onto the closure of the subspace Mg := 
{ B® mB ew} G He. We prove the stated inclusion we have to check that 
EC® = C®, i.e, C® € Mg, since then for any € > 0 there is an operator 
Bie) € Ao(S) such that pg(B(e)—C) = \|B(e)® — CO x <é. 

The set Mg can be written as Ao(S)®, where Ap(S) := {B: B € Ao(S)}. 
Thus Ao(S)Ms C Me and since the operators in question are bounded, we also 
have Ao(S)Me C Me by Problem 1.33c, i.e, EBE = EB for all B € Ap(S). We 
can easily check that set Ao(S ) is symmetric; hence the same identity also holds for 
B* , which implies BE = EB. In addition, EB = B® follows from the definition 
of ihe projection E, so BE® = EB® = B® for any Be Ao(S). In other words, 
all operators of set sighs ) are reduced by the projection E. 

Next we introduce the projections E, : E,.® = (One, OpdpOmon}erthey 
satisfy the condition °72, E, = I , where the convergence os this series and similar 
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ones is understood. with respect to the topology 7,. Given an operator Ae B(H) 
we define Aj; := E,AE; ; using the ———- isomorphisms V;: E;H — H we can 
associate the operators Aj; := V;Ai;V;* € B(H) with these sere elements”. The 


relation EBS = B® implies E,.B(I—E)® = 0 for all k = 1,2,..., and therefore 


0= >). E,BE(I-E)E;® = De, bri Brte(5ij Ej — Big) ® 
7— 


1 tj=l 


because B is by definition reduced by all the projections E,. Summing over i and 
using operator V,, we put the last relation in the form B(¢x — 72, Exj$;) = 0. 
Since it is true for any B € Ao(S), we get 


dr — >. Exjoj € () KerB c () KerB = KerEs. 
j BEAo(S) Bes 


Operator C’ belongs by assumption to As, so C = CEs and the above relation 
implies C(¢%— 2, Exj;) = 0. Moreover, the operators B € Ao(S) commute 
with £, and at the same time, with all E,. Hence 


Buk = E,BEE; = E,EBE; — E,;B;;, 


and a unitary operator V; again we arrive at BE,;; = E,;B for all i,j = 
1,2,.... It follows that Ej; € S’, and since operator C_ belongs to S" we have 
a = Pye Eej3C'; . This is cnssiciilans: to E,.C® = E,EC¢, and summing over k 
we arrive at the sought relation EC =C®. 

The rest of the proof is easy. By Problem 8c, B = EsBEs holds for any 
B€S, and therefore also for all B € Ap(S). At the same time, Ap(S) C S” so 
Ao(S) C As. Due to (a), the algebra As is weakly closed; then the above result in 


combination with Remark 1 gives (Ao(S))w GC As C (Ao(S))os C (Ao(S))v. Hi 


The answer to our question can be derived easily from the proved theorem. If 
a weakly closed *-algebra A contains the unit operator, we have Ay = A”, so 
(Ao(A))w = (A)w = A. On the other hand, A= A” implies J € A because the 
unit operator belongs to the commutant of any subset in B(H). Hence we have 


6.3.5 Corollary: A weakly closed *-algebra AC B(H) is a W*-algebra iff it 
contains the unit operator. 


The second question posed above also has a simple answer: in view if the rela- 
tions between the topologies an operator C*-algebra is a W*-algebra if it is weakly 
(which is the same as o~strongly) closed and contains the unit operator. A related 
problem, namely under which conditions an abstract. C*-algebra can be represented 
isometrically by a W*-algebra, is much more complicated (see the notes). The dif- 
ference between the two classes is illustrated by the fact that W*-algebras contain 
“sufficiently many” projections; recall Example 3b which shows that an operator 
C*-algebra may contain trivial projections only. We denote by A® the set of all 
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projections in an algebra A C B(H). It contains clearly the unit operator, so 
A,,(A®) is a W*-algebra. 


6.3.6 Proposition: A,,(A®) = A. 

Proof: By Problem 8a the left side is equal to (A%)”. The inclusion A? Cc A 
implies A® c (A*®)” c A" = A, so an arbitrary projection belongs to A iff 
it is contained in (A*®)”. Next we use Lemma 6.5.3 which will be proven below; 
it claims that a weakly closed algebra with the unit element contains a Hermitean 
operator A iff it contains all projections of its spectral decomposition {E;,}. Hence 
{E:} c A® c (A®)”, and using the lemma once again we get A € (A®)”. Since 
any B € A is a linear combination of two Hermitean operators, the assertion is 
proved. §f 


In the rest of this section we are going to describe how W*-algebras can be 
classified. ‘To this end, we have to introduce several new notions. An algebra A is 
called a factor if its center Z,4 := AMA’ consists of scalar operators only; simple 
examples are B(H#) itself and the algebra C(H) of scalar operators. A W*—algebra 
satisfies the identity A= .A” by definition, so if A is a factor the same is true for 
A’. Notice that a factor may be reducible as an operator set (cf. Section 6.7 below); 
it is important that it is not reduced by a projection belonging to A. The opposite 
extremum is represented in a sense by a commutative algebra, which is equal to its 

center and reduced by any of its projections. 

F Let E be a projection in B(H). For any B € B(H) the operator EBE is 
fully determined by its part in the subspace EH, which we denote as Bz ; this 
allows us to associate the operator set Sz := {Be : B € S} with any Sc 
B(H). In particular, we have (Ag)’ = (A’)e provided A is a W*-algebra and 
Ee€EA (cf. Problem 10). It follows that Ag is a W*-algebra; it is called the 
reduced W*-algebra (corresponding to projection EF). A projection E € A is said 
to be Abelian (with respect to algebra A) if the reduced algebra Ag is Abelian; 
in particular, any minimal projection in A is Abelian (Problem 11). Projections 
E,F froma W*-algebra AC B(H) are called equivalent if A contains an operator 
U€é4A such that U*U = E and UU* = F, 1.e., a partial isometry with the initial 
space EH and the final space FH. Projections E,F are equivalent with respect 
to B(H) iff they have the same dimension; in general the identity of the dimensions 
is a necessary but not sufficient condition for equivalence of the projections. 

Simple properties of W*-algebra *-morphisms are listed in Problem 12. An 
important class of maps between W*-algebras AC B(H) and Bc B(G) consists 
of spatial isomorphisms: a *-isomorphism y: A — B is called spatzal if there is 
a unitary operator U € B(H,G) such that y(B) = UBU™ holds for all Be A. 
In such a case y is at the same time a spatial isomorphism of algebras B(H) and 
B(G) , and it represents a bijective correspondence between the centers of algebras 
A,B as well as between their commutants, y(A’) = B’ ; it is also obvious that 
dim H = dimG. This last relation means, in particular, that not every isomorphism 
of W*-algebras is spatial; a simple example is provided by the natural isomorphism 
y: C(H) —C, y(al) =a, for dimH > 1. 
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Next, consider a family of W*-algebras A, € B(H.), where a runs over an 
index set J. Their direct product introduced in Section 6.1 can be identified with 
an *-operator algebra on H := @acy Ha if we associate the operator B: Blda] = 
[Bada] with any [B,]. This algebra has the following properties (Problem 13). 


6.3.7 Proposition: The norm of B equals ||B|| = ||[Bal|| = suPaey ||Balle- 
we denote A := te -shi and B := @acyA),, then A’ = B and B= A. _ 
projections FE, := [6ael,], where Ig is the unit operator on Hg, are contained 


in A for all Be € J; an operator C € B(H) is contained in @gcyB(Ha) iff it 
commutes with all E,. 


This implies, in particular, that A := @acyAa is a W*-algebra; we call it 
the direct sum of the W*-algebras Aa. By Problem 12, the center of A is Z4 = 
@acy ZA, - On the other hand, if the center of a W*-algebra A contains a complete 
system of projections, we can express A as the direct product of the corresponding 
reduced algebras (Problem 14). 

Finally, consider W*-algebras A; C B(H;), 7 = 1,2. The W*-algebra gen- 
erated by the set A, x A2 := {B; ® B,: B; € A;} is called the tensor product 
of the two algebras and is denoted as A; @ Az. The set A; x A contains the 
unit operator, so A; ® Ap = (A; x A2)” follows from Theorem 4. Notice also that 
since A;, Aj are at the same time C*-algebras, we have to compare the present 
definition with that in Section 6.1. There, the realization of the tensor product in 
B(H, @ H2) is also constructed as a closure of the algebra A := Ao(A; x Ao). 
However, in general only the inclusion (A)~ D (A)y is valid; hence the two tensor 
products are mutually different; some authors use the terms tensor C*-product and 
tensor W*-product to distinguish them. 

After these preliminaries, let us describe how W*-algebras can be classified. 
Since the corresponding proofs are not needed in the following, and are for the most 
part complicated, we limit ourselves to formulating the results; the appropriate 
references are given in the notes. In the rest of the section the symbol A always 
means a W*-algebra in some B(H). 

We call A a type I (or discrete) algebra if it is «isomorphic to a W*-algebra 
with an Abelian commutant. For instance, A := ((H) is type J because it is 
*-isomorphic to the algebra € , though A’ = B(H) is noncommutative unless 
dim#H = 1. In the particular case when A is a discrete factor, a more detailed 
classification is possible. 


6.3.8 Proposition: Let A bea factor; then the following conditions are equivalent: 
(a) A is discrete, 
(b) A contains minimal projections (cf. Problem 11), 


(c) there are Hilbert spaces Hi,#2 such that A can be identified with the tensor 
product B(H,) ® C(H2), 
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(d) there exist a Hilbert space G and an index set J such that A is spatially 
isomorphic to the algebra A:= {B: Bl¢a] = [Bdoa], B € B(G)} on the 
Hilbert space @yesG. 


Hence any discrete factor is *-isomorphic to algebra B(G) for some Hilbert 
space G ; we classify these factors according to the dimension of G to algebras of 
type In, n= 1,2,..., and type Ico. 

This classification extends to a wider class of discrete algebras. A W*-algebra 
is called homogeneous if it contains a family { E,: a € J} of mutually orthogonal 
and equivalent Abelian projections such that ye; Ea = I ; such an algebra belongs 
respectively to type J, or J. according to the cardinality of index set J. Correctness 
of the definition follows from the fact that any two such families of projections have 
the same cardinality. Any homogeneous algebra is discrete. On the other hand, 
any discrete factor A is homogeneous (Problem 15), so its type is defined in two 
different ways. It is easy to see, however, that the two definitions are equivalent: A 
is *-isomorphic to algebra B(G) in which the corresponding family consists of the 
one-dimensional projections referring to vectors of some orthonormal basis; thus its 
cardinality is dimG. 

Commutative algebras are particularly simple. In such a case the unit operator is 
an Abelian projection with respect to A ; hence any Abelian W*~algebra is discrete, 
homogeneous and of type /;. On the other hand, for general discrete algebras we 
have the structural result according to which any such algebra is spatially isomorphic 
to a direct sum of homogeneous algebras belonging to mutually different types: 


6.3.9 Proposition: In an arbitrary discrete W*-algebra A we can find a system 
{ E;: 7 € J} of mutually orthogonal projections such that A can be identified with 
the direct sum @je;Az,. Moreover, every reduced algebra Ag, is homogeneous 
and of type In,, nj < 00, and nj # mn, for any pair j,k EJ. 


Continuous algebras are the complement to the class of discrete W*-—algebras. 
A W*-algebra is said to be continuous if its center contains no nonzero projection 
FE such that the reduced algebra Ag is discrete. It is clear from the definition 
that any factor is either discrete or continuous. An important feature of continuous 
algebras is that they contain no minimal projections. 


6.3.10 Proposition: An algebra A is continuous iff any projection E € A can 
be expressed as a sum of two orthogonal equivalent projections. 


It also can be proved that any algebra reduced from a discrete (continuous) 
algebra is respectively discrete or continuous; hence a W*-algebra cannot be discrete 
and continuous at the same time. This does not, however, mean that a W*—algebra 
should be either discrete or continuous; this is true only for factors. In the general 
case we have the following structural theorem. 


6.3.11 Proposition: Any W*-algebra A contains orthogonal projections FE, F' ¢€ 
Za, E+F =I, such that Ag is discrete and Af continuous. In other words, A 
is spatially isomorphic to a direct sum of a discrete and a continuous algebra. 
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To get a more detailed classification we introduce traces. Let A; be the cone 
of positive operators in A; then a trace isa map 7: A, > IR*, A, C A,, with 
the following properties: 


(i) r(aB+C) = ar(B)+7(C) for all B,CEA,,a EC, 
(ii) r(UBU~!) = r(B) holds for all B € A, and all unitary operators U € A. 


In adition, we set 7(B) = oo for Be A\ A,. Condition (ii) is equivalent to the 
requirement 7(B*B) = 7(BB") for all Be A. A trace 7 is called finite if A, = 
A, , and semifinite if for any nonzero B € A, there is a nonzero positive operator 
CeéA, such that C < B. A trace 7 is said to be faithful if 7(B) = 0 implies B = 
0, and normal if for any sequence {A;} C A,, such that A := w-limpico D521 Aj € 
A, exists, the relation (A) = (72, 7(A;) is valid; an alternative definition will be 
given in the next section. If H is separable, for instance, then the map 7: 7(B) = 
Tr B is a faithful normal trace on B(H) (Problem 16). Its domain is B(H), = Ji, 
so T is finite if dim# < oo and semifinite otherwise. 

An algebra A is said to be finite (semifinite) if for any nonzero B € A, there is 
a finite (respectively, semifinite) normal trace tg on A such that 7(B) #0. On the 
other hand, if there is no finite (semifinite) normal trace on A, the algebra is called 
properly infinite (respectively, purely infinite). It follows from Proposition 8 and 
Problem 16 that any discrete factor is semifinite. More generally, discrete algebras 
are semifinite; in particular, any algebra of type J,, where n is a positive integer, is 
finite. Continuous W*-algebras are classified in the following way. If A is semifinite 
it is of type IJ. If, in addition, it is finite we speak about about type JI, , while 
a continuous A which is semifinite and properly infinite belongs to type II,,. A 
purely infinite algebra is always infinite; we call it a type JJ algebra. One can prove 
in a constructive way that all the listed types exist. 

Bach continuous factor belongs to just one of the types IJ,, IJ,., IJI, and since 
any factor is either discrete or continuous, we have the following result. 


6.3.12 Theorem: Any factor belongs to just one of the types I,,n = 1,2,... , Is, 
Mie Long Tt . 


’ 


A general W*~algebra can be divided, from the viewpoint of finiteness, in a way 
similar to Proposition 11. 


6.3.13 Proposition: In the center of any W*-algebra A there are pairs of mutually 
orthogonal projections E,,F, and E2, F, such that EF, < E> and Beek; =e, 
and the reduced algebra Ag, (respectively, Ax,, Ar,, Ar, ) is finite (respectively, 
semifinite, properly infinite, purely infinite). 


Combining these results we arrive at a complete classification of W*-algebras. 


6.3.14 Theorem: Any W*-algebra is spatially isomorphic to a direct sum of alge- 


bres-efuypes”/,,, w=ie2,... , lag, Maa TEs ; each type is contained at 
most once in the direct sum. 
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6.4 Normal states on W*—algebras 


Any state f ona W*-algebra A C B(H) is additive by definition; in particular, 
f(Eit---+En) = f(Fi) +---+f(E,) holds for orthogonal projections E,,..., En. 
On the other hand, a similar relation may not be valid for an infinite family of pro- 
jections. Strengthening the additivity requirement we obtain an important subclass 
in the set of all states: a state f on A is called normal provided 


j (x E.| = Yo f(Ba) (6.2) 


ae T aed 


halds for any family { F. : a € J} of mutually orthogonal projections in A, and 
furthermore, a nonzero positive functional g on A is said to be normal if the state 
I fl|-1f is normal. 

Relation (6.2) needs a comment. If index set J is at most countable its meaning 
is clear: we arrange projections EK, into a sequence and define Soucy; Fa as the limit 
of partial sums and the right side as the sum of the corresponding series; the numbers 
f(£q) are non-negative, so the order of summation is not important. In the case 
of a nonseparable H the set J may be uncountable; then >.<; Fa is given by 
Proposition 3.2.13. On the other hand, motivated by the countable case, we define 
the right side of (6.2) as supxes Lack f(Ea), where S C 2” is the family of all 
finite subsets in J, i.e., as a limit of the net of finite partial sums (see also the 
notes). 

Our main aim in this section is to derive conditions which specify the subset of 
normal states in the set of all states, and to find a general expression for a normal 
state. The result, the proof of which will proceed through a series of lemmas, can 
be formulated as follows. 


6.4.1 Theorem: Let f be astate ona W*-algebra AC B(H) ; then the following 
conditions are equivalent: 


(a) f is normal, 
(b) f is o—weakly continuous, 


(c) there is a statistical operator W € Z% such that f(B) = Tr(WB) holds for 
all BEA. 


6.4.2 Remark: A statistical operator on a separable Hilbert space 7 was defined 
in Section 3.6 as a positive trace-class operator W such that TrW = 1. This 
definition naturally extends to the case of a nonseparable 7, where the operator 
W is required to be positive with (KerW)+ = RanW separable and such that 
the nontrivial part of W, i.e., the restriction W}(KerW)+ satisfies the stated 
conditions. 


6.4.3 Lemma: A functional f which is the limit of a sequence of o-strongly 
continuous linear functionals on a W*-algebra A is o-strongly continuous. 
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Proof: In view of Proposition 6.3.2, it is sufficient to check that f is o-strongly 
continuous on the unit ball A; := {B € A: ||Bl] < 1} (see the notes); this 
follows from the estimate |f(B)| < |fn(B)| + l|f—fnll || Bl] in combination with the 
assumption ||f—fnl| +0 asn—-oo. Ff 


6.4.4 Lemma: Let f be a linear functional on a W*-algebra A C B(H) ; then 
the following assertions are equivalent: 


(a) there is a sequence © = {¢,} € H (in the notation of Remark 6.3.1) such that 
|f(B)| < pe(B) holds for any BE A, 


(b) there are sequences ©, V € H such that the relation f(B) = DS, (de, Bux) 
holds for all Be A, 


(c) f is o—weakly continuous, 
(d) f is o-strongly continuous. 


Proof: It is sufficient to check the chain of implications (a) > (b) > (c) > (d) => (a). 
The intermediate two are simple. To prove the last one, assume that (a) is not valid; 
then for any ® € H there is an operator Bs € A such that |f(Bg)| > || Ba®|lx. 
By Problem 7, the o-strong continuity implies, in particular, that we can find 6 > 0 
and a nonzero vector V € H such that |f(B)| <1 holds for all B € A fulfilling the 
condition BU <6. The last inequality is valid, e.g., for C := || Be®||=" Be with 
® := 2. This is, however, impossible because then f(C) = ||BaPl|z Lf ( Bait <7 
in ‘cniseilintinn to the assumption. 

It remains for us to check that (b) follows from (a). We define the linear func- 
tional go : go(B®) = {(B) on the subspace {B&: Be A} CH; the definition 
is correct since B # C implies B #4 C. By assumption, |f(B)| < |BO|]5 holds 
for all B € A, so the functional go is bounded; in view of Proposition 1.5.7a we 
can extend it to a bounded functional g: H — (. Finally the Riesz lemma gives 

g(®') = (U, &), for some V € H and all © € H; in particular, f(i)= g(B&) = 
(W, Bo), for any B € A, which is just assertion (b) up to the interchange of ® 
and Vv. § 


6.4.5 Lemma: Let f be a positive functional on a W*-algebra A C B(H) ; then 


(a) if f(B) < (¥, By) holds for some vector ~ € H and all B € A,, then there 
is an operator C € A’ such that f(B) = (Cy, BCy) for all BE A, 


(b) if f expresses in the form f(B) = (¢, Bw) for some ¢,% € H, then there is 
a vector x € H such that f(B) = (x, By) forall BEA. 


Proof: We define the form g on the subspace Ay by 9(Byq), Bow) := f(B* Bo) 
for all B,, Bz € A. This makes sense: it follows from the assumption and Proposi- 
tion 6.2.2c Sg If(C*B)?? < f(B*B)(p,C*Cy) , so f(C*B) =0 forany B,CE A 
if Cy = 0 ; the same is true for f(B*C). The form g is obviously symmet- 
ric and positive, and the Schwarz-inequality argument shows that it is bounded, 
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\g( Bis, Bow)| < || Bit)||?|| Bow ||?. Hence it can be extended to a bounded form on 
Av C H and by Proposition 3.1.1 there is a positive operator A, € B(Ay) such that 
f(BiB2) = (Bi, AgBow). Then we also have (B,y~, A,B3;BoW) = f(BiB3Be2) = 
f((B3B1)* Bz) = (B3 By, A, Bow), i.e., 


(A, B3—B3A,) \ Ad = forall Bg € A. 


Thus A := A,@0, where 0 is the zero operator on (A7))*, is positive and belongs 
to A’. By Proposition 3.2.6 the same is true for C := A’/? and f(B) = (~, AB) = 
(wb, C? By) = (Cy, BCy) for any BE A. 

To prove part (b), we use the parallelogram identity which yields 4f(A) = 
2(¢, Ay) + 2(, Ad) = (+4, A(O+4)) — (6-Y, A(G—Y)) for any Ae A. Hence 
4f(B) < (6+, B(¢+y)) holds if B is positive, and by (a) there is an operator 
CeéA’ such that f(B) = (x, Bx), where x:= $C(¢+y). OE 


6.4.5 Lemma: Let f,g be normal positive functionals on a W*-algebra A C 
B(H). If f(E) < g(E) holds for some projection EF € A, then A contains a 
nonzero projection F' < & such that f(B) < g(B) for all operators B € A which 
satisfy the condition O< B<F. 

Proof: Consider the family of projections M:= {P< E: f(P) > 9(P)}, which is 
partially ordered by operator inequalities. Let N be a completely ordered subset 
in M. If it is countable we can write it as a nondecreasing sequence {P,} ; by 
Theorem 3.2.12 it converges strongly to some projection P. The functionals f,g 
are supposed to be normal, so f(P) = limpioo f(Pr) = limpeo 9(Pa) = 9(P), te., 
the set N has an upper bound. The same is true for an uncountable N when the 
sequence has to be replaced by a net of projections. Zorn’s lemma then implies the 
existence of at least one maximal element Ep. 

The projection F := E—Ep is nonzero since otherwise f(F) = f(Eo) > g(o) = 
g(E). Assume that A contains an operator B such that 0< B< F and f(B)> 
g(B). We denote its spectral measure by Fa(-) ; due to Proposition 6.3.2 and 
Lemma 6.5.3 which we shall prove below, the algebra A then also contains the 
projections Ep(J) for any interval J C JR. We shall check that for some of these 
projections the inequality f(Es(J)) < g(Ea(J)) cannot be valid. In the opposite 
case, we could choose a suitable sequence {By}, say 


Wk k-1k 
a Oe (I: 
dW ‘ N’N 


which approximates B in the operator norm in view of Theorem 5.2.2; we use the 
fact that the spectrum of B is contained in the interval [0,1]. The functionals f,g 
are bounded by Proposition 6.2.2e, so f(B) = limnoo f(Bn) < limyso g(By) = 
g(B) in contradiction with the assumption. This means that there is at least one 
nonzero projection FE, := Eg(J,) corresponding to some J, C [0,1] such that 
f(F:) > 9(). 

The inequalities 0 < B < F imply 0 < ||B¥/?y||? < ||Fy||? for any ~ EH, te., 
Ker F Cc Ker B and Ep(R\ {0}) < F. Then FE; := Ep(J1) < Es(R\{0}) < F so 
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E, < F < E, and furthermore, EyE, = (E-F)E; =0. This means that Fo+ Fy 
is a projection which should belong to M ; however, this is impossible since Ep is 
amaximalelementin M. [& 


Proof of Theorem 1: The most complicated part is to check that (a) implies (b). 
Given a projection E € A we define fe(B) := f(BE) and denote M:={E eA: 
fe is o-strongly continuous }. Let {E;} Cc M be a sequence of mutually orthog- 
onal projections; we denote P, := i» 4 E; and P :=s-lim,p..P,. The inequality 


|f(BE)? < f(B"B)f(E) < f(E)||BI!? 


holds for any projection E € A and for all B € A; choosing E := P—P, we 
get ||fp—fp,|| < f(P—P). The state f is normal by assumption, so we have 
limgooo f(P—P,) = 0, and therefore fp is o-strongly continuous, 7.e., P belongs 
to M. This argument easily extends to the situation when {P,} is replaced by a 
nondecreasing net of projections {P,}: the projection P := sup, P, again belongs 
to M. It now follows from Zorn’s lemma that the set © contains at least one 
maximal element Ep. 

Suppose that the projection E,; := I— Eo is nonzero. We choose a vector 
@ € E,H, ||d|| > 1, and define the functional gs : gs(B) = (¢.Bo) which is 
obviously positive and normal. We have f(F,) < ||Fill < wil? = 96(E:) ; hence 
by Lemma 6, A contains a nonzero projection E2 < FE, such that f(B) < (¢. Bo) 
holds for all B € A with 0 < B < Ey. The operator B := ||C||-?E.C*CE, for 
any nonzero C € A obeys this condition, so ||C||~?f(ExC*CE2) < ||C||-?||CE.6||* . 
and furthermore, 


If(CE2)? < f(1)f(ExC*CEy) < ||CEx¢|l. 


Thus the functional fg, is strongly continuous, and therefore also o-strongly con- 
tinuous. By assumption, E2E) = E2(1—E,) = 0, ie., E+ Ep is a projection 
which should belong to M , but this contradicts the fact that the projection Eo is 
maximal. In this way we get Eo = J, which means that the functional f itself is 
o-strongly continuous, and by Lemma 4 it is at the same time o-weakly continuous. 

Next we are going to show that (b) implies the existence of a vector ® € H (in 
the notation of Remark 6.3.1) such that ||| = Df, |lde|]? = 1 and 


= 3 (eB) (6.3) 


for an arbitrary B € A. First we define the functional f onthe W*-algebra A:= 
{B: BE A}c B(H) by f(B) := f(B). In view of Lemma 4, f(B) = (W’, BV) 
for some WV,’ € H, and since f is positive, Lemma 5 implies the existence of a 
vector ® € H such that f(B) = (®, Bo), for all B € A, i.e., expression (6.3). 
Moreover, f is by assumption a state, so f(/) = ales = 

Relation (6.3) in turn implies the condition (a). Let {E.: a € J} bea family 
of mutually orthogonal projections in A and EF := Dgey Eq. For any k there is 
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an at most countable subset J, C J such that Eu¢g, = 0 for ae J\ J. The 
set K := Uf, Je is also at most countable; hence the corresponding subfamily 
of projections can be ordered into a sequence {E;} and )%,E; = E. Then 
f(E) aaa Dei (Pe, Ede) = QE a |E; Edel? , and moreover, f(E) < |E | =1, 
so the last series converges absolutely and we may rearrange it, 


f(E) = met (On Bde) = SHB = 2 fEa); 
ae 


j=lk=1 


z.e., the state f is normal. It remains for us to check that expression (6.3) is equiv- 
alent to condition (c). If the former is valid, we can define Ww := 72, (dbe, W) dx ; 
where the series converges with respect to the norm in H. The estimate 


< YS |e) dell < Hol DD del? 


k=m+1 k=m+1 


5 esti) oe 


k=m+1 


shows that operator W is defined on the whole H ; it is easy to check that W 
is positive and ||W|| < 1. Let {w~.} be a total orthonormal set in H. Each 
of the vectors ¢, has at most countably many nonzero Fourier coefficients; hence 
there is an at most countable subset in {w.} such that its complement belongs 
to KerW. We arrange it into a sequence {y,;} ; then TrW = O,(Y;,Wy;) = 
Dy Le |(Ge, s)|? = Xe |lell? = 1. This means that W is a statistical operator; in 
view of Theorem 3.6.8a the series expressing Tr(WB) converges absolutely and 


Lis, W By;) = XL Lhe Bu;) (3; bx) 
SS (Bb: 3) (V5, Ge) = >So (B* bk; be) = WB), 
7 ok P 


Tr (WB) 


i.e., assertion (c). On the other hand, if W is a statistical operator with the required 
properties and >>, wie(Yx,*)Vx is its spectral decomposition, we set dg = Wk Ve } 
in the case dim RanW < oo we append zeros to the obtained finite sequence. It 
is straightforward to check that the functional f is expressed in the form (6.3) by 
means of the vector ® constructed in this way. & 


The statistical operator W , which corresponds to the state f by the proved 
theorem, is in general not unique, and it need not belong to the algebra A. A 
simple illustration is provided by the algebra €'(H) of scalar operators on which 
there is just one state f : f(a) =a which is normal and pure. Any statistical 
operator from B(H) can in this case be taken for W , and moreover no such operator 
belongs to ((H) if dim#H = oo (see also Problem 17). We shall show that the 
nonuniqueness can be removed provided we restrict our attention to a particular 
class of W*-algebras. Assume that 


(aol) A is a direct sum of type J factors, 


(ao2) any minimal projection in A is one-dimensional; 


226 CHAPTER 6. OPERATOR SETS AND ALGEBRAS 


in view of Proposition 6.3.8 any algebra which obeys these requirements can be 
identified with 

A= @ B(Ha), (6.4) 

aéeJ 

where {H,} is some system of Hilbert spaces,  @.;Ha = H. Below we shall 
encounter algebras of this type as algebras of observables of quantum mechanical 
systems. Condition (ao2) is then a natural simplicity requirement; it is useless to 
associate an n-tuple of identical operators with a given observable. 


6.4.7 Theorem: Suppose that a W*-algebra A C B(H) satisfies the conditions 
(aol) and (ao2); then 


(a) there is a bijective correspondence between the normal states f on A and the 
statistical operators W € J%,(H) which are reduced by all the subspaces H, , 
such that 

f(B) = Tr(WB) (6.5) 


holds for all Be A, 


(b) a normal state f on A is pure iff the corresponding statistical operator W 
is a one-dimensional projection. 


Proof: Due to Theorem 1 a normal state f can be expressed in the form (6.3) 
and A can be identified with algebra (6.4). Let FE. denote the projections onto 
the subspaces 7{,. Each of the vectors ¢, has at most countably many nonzero 
components dka := Eade corresponding to a subset 4 C J, so the system { ka : 
aé Jp, k = 1,2,...} can be ordered into a sequence {¢¥)}. Since any operator 
Bé A commutes with all E., we get f(B) = Dy Lacey, (Pra: BOka) ; it follows 
from D1 Lacy, |lPeall? = SR |leel]? = 1 that the series converges absolutely and 
we can rearrange it, 


f(B) = z (¢%, Bd). 


Thus we have found for f a new expression of type (6.3), where now each of the 
vectors $7) belongs to some subspace H. We associate a statistical operator W 
with the sequence {¢)} in the same way as in the proof of Theorem 1; then 


WE = > (O, Bah)o 7 >= (Phas P V)bka = is LO. = E.Wy 


j=1 k=1 


holds for any ace J, WE H, ie., W commutes with all the projections E, and 
therefore it belongs to algebra A. 

Suppose now that statistical operators W,, W2 correspond to a given f, so 
that both of them satisfy relation (6.5). The operator W := W,—W, is Hermitean 
and of the trace class, and Tr(WB) = 0 holds for all B € A. In particular, 
TrW? = ||W||2 = 0, hence W =0. The map f + W is thus injective. It is also 
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surjective: for any statistical operator W the relation (6.5) defines a state on A, 
and mimicking the argument from the previous proof we can show that it is normal. 

To prove (b) we employ Theorem 6.2.7. The GNS representation yw corre- 
sponding to state (6.5) was constructed in Example 6.2.4b; we have seen that it 
is reducible if dim(KerW)+ > 1. In the opposite case W is a one-dimensional 
projection corresponding to a vector y € H. Operator W commutes with all FE, , 
so ~ must belong to the subspace Hg for some @ € J. Due to (6.4) the GNS 
representation my of Example 6.4.2a fulfils Hy = Hg and 7y(A) = B(Hg), which 
means that it is irreducible. § 


In conclusion, let us briefly mention the states which are not normal; for sim- 
plicity we shall consider only the algebra A = B(H). We have shown that a normal 
state on A corresponds to a unique statistical operator Wy which in turn defines 
the GNS representation my := tw, (cf. Example 6.2.4b); in this case any B € A is 
represented by the direct sum of its N identical copies, so ry is clearly faithful. By 
negation, f is not normal if the corresponding GNS representation is not faithful. 
It is easy to see that the kernel of 7, forms in that case a nontrivial ideal in B(H) 
which, in view of Proposition 6.1.2, is closed with respect to the operator norm. 
However, the following result is valid (see Problems 19-21). 


6.4.8 Theorem (Calkin): If is separable, then the only nontrivial ideal in B(H) , 
which is closed with respect to the operator norm, is the ideal K(H) of all compact 
operators in B(H). 


Hence f(B) =0 must hold for any B € K(H) if wy is not faithful. In view of 
the role played by the quantities f(A) in quantum theory, which will be discussed 
below, this suggests that non-normal states can have rather pathological properties. 


6.5 Commutative symmetric operator sets 


From now on we are going to consider more general operator sets including those 
which contain unbounded operators. With the needs of the following chapters in 
mind, however, we shall be interested primarily in the case when the unbounded 
operators are self-adjoint. 

An operator set is called symmetric if, together with each T € S, it also contains 
its adjoint; this notion obviously makes sense only if all the operators contained in S 
are densely defined. The symmetry is preserved at some set operations (Problem 22). 
Examples of symmetric operator sets are easily found. The set £,.(H) of all self- 
adjoint operators on a given H is symmetric. On the other hand, the set £,(H) 
of all symmetric operators is not symmetric because the adjoint to a symmetric 
operator need not be symmetric (cf. Example 4.2.5). If aset S of bounded operators 
is symmetric, then its linear envelope Sin is also symmetric; this is not true in the 
general case since the operators of Sjin, need not be densely defined. 

The notion of commutativity has been introduced in situations when either 
at least one of the two operators is bounded (Section 4.4) or both of them be- 
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long to a particular class, e.g., they are self-adjoint. Motivated by this, we define 
Lysa(H) = B(H) U Laa(H). A set S C Lysa(H) is called commutative if arbi- 
trary operators T,S € S commute mutually. Any subset of a commutative set is 
again commutative; in particular, an intersection of commutative sets is commu- 
tative. Operator sets S1, So € Lb,sa(H), in general noncommutative, are said to 
be commuting if any operator T; € S; commutes with all T,; € S2. The union 
of commuting commutative sets is a commutative set. Instead of studying a given 
S C Lysa(H) it is sometimes useful to treat some other set constructed from it. For 
instance, we can define 


Sp := {SNL a(H)} U {ReB, mB: BESNB(H)}, 
Sy, = {SNBA)} U {£4 : AMSLLFONS , MP Bere) , 
Sp = {SNB(H)} U { f(A) : A€ Lea(H)NS, f bounded Borel } ; 


the first set consists of self-adjoint operators only, while the other two are subsets 
in B(H). Some of their simple properties are listed in Problems 24, 25. 

The task of describing the structure of an operator set is considerably simplified 
if there are functional relations between its elements, which allow us to choose 
“independent” operators and to express the rest by means of them. 


6.5.1 Examples: (a) Consider operators A,B € B(€") such that A is Hermitean 
with a simple spectrum, o(A) = {Aj,...,An}, so the corresponding eigen- 
vectors form an orthonormal basis {e;}f_., in (". If the operators A,B 
commute, e; are at the same time eigenvectors of B ; mimicking the argu- 
ment of Example 5.2.3 we readily check that there is a polynomial Q of degree 
<n-—1 such that B= Q(A). 


(b) Let E,, E2, FE; be mutually orthogonal one-dimensional projections on C3. 
The operators A; := E,+ FE, and Ag := E,+E3 are Hermitean and commute 
mutually; however, neither of them is a function of the other. We can, of 
course, find a Hermitean A commuting with all E; such that A,, Ao are 
its functions: it is sufficient to put A := D3_,A;E;, where A; are mutually 
different real numbers. 


(c) If an operator set S is noncommutative or nonsymmetric, there may be no Her- 
mitean operator such that all elements of S would a its functions. Consider, 
e.g., the operator B € B(C*) represented by ( hac 6 ). The set {B} is not 


symmetric while {B, B*} is not commutative; the operator B is a function 
of no Hermitean operator since otherwise it would be normal. 


A Hermitean A € B(H) is called a generating operator of a set S C Losa(H) 
if for any T € S there is a Borel function fr such that T = f(A). It is clear 
that the generating operator is not unique: another Hermitean operator A such 
that A = h(A) for some Borel function h is also a generating operator of the set 
S and fr := froh. The above examples show that the generating operator may 
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not belong to S and it may not exist if S is not symmetric or commutative. A 
sufficient condition for existence of a generating operator is based on the following 
result. 


6.5.2 Theorem (von Neumann): Suppose that H is separable and A is a weakly 
closed commutative *-subalgebra in B(7) ; then there exists a Hermitean operator 
Aé€éA which generates A, i.e, A= A,({A}). 


6.5.3 Lemma: Let AC B(H) be a weakly closed algebra. A Hermitean operator 
A with the spectral decomposition {F;} belongs to A iff the algebra contains the 
projections ~; for all t <0 and J—£&; for t > 0. If this family of projections is 
denoted as {Pi)}, then A,({A}) = Av({Pi}). 


Proof: Let us begin with the sufficient condition. The spectrum of A is contained 
in some interval (—b,b). Given a positive integer n, we can write it as the union 
of the subintervals Jf := (b(k—1)/n, bk/n), k = —n+1,...,n. By Problem 5.3b, 
the corresponding spectral projections are E4(J}}) = Eok/n— Excnt)n. They belong 
to A for k < 0 due to the assumption, and furthermore, the identity E,4(Jf) = 
(I—Evx/n) — (I—Exe-1)/n) shows that the same is true for k > 0. Then the algebra 
also contains the operators Ay := Dfe_nyi @ Ea(JP), n = 1,2,..., so combining 
the weak closedness of A with Theorem 5.2.2d, we find Ae€ A. 

The functional—calculus rules also yield the necessary condition. By Proposi- 
tion 5.3.4 a sequence of polynomials {P,} can be found to a given s € (—b,0) such 
that limp—oo Pa(t) = x{-0,s(t). The same is true for the sequence of homogeneous 
polynomials P,(t) := P,(t)—Pa(0) ; we have P,(A) € A even if I ¢ A. Using 
the weak closedness together with Theorem 5.2.2d once again, we get E, € A for 
s <0; in the same way we can check that J—E, € A holds for s > 0. The last 
assertion follows easily from the proved equivalence. 


6.5.4 Lemma: Let H be separable; then we can select from any infinite set S C 
B(H) acountable subset So such that for any B € S there is a sequence {B,} C So 
fulfilling the conditions s-lim,.Bn = B and s-limp.B* = B’*. 


Proof: Since the set S can be expressed as the countable union Urj{B cS: 
n—1< ||B|| <n}, it is sufficient to assume that there isa K such that ||B|| < K 
holds for all B € S. We construct the Hilbert space H as the countable direct sum 
of identical copies of the space H (as in Remark 6.3.1); the latter is separable by 
assumption, so the same is true for 1. We choose a countable set {¢;} of nonzero 
vectors, which is dense in H and associate with each operator B € S the vector 


@p := {c1B¢1,c1B"b1,c2Boa,cB*ba,...} EH 


with c; := 271||¢,||"'. The subset ®s -= {@g: Be S} in the separable metric 
space H is separable itself (Problem 1.22), so we can choose a countable set Sp C S 
such that the corresponding ®s, is dense in ®s ; in other words, for any Be S 


230 CHAPTER 6. OPERATOR SETS AND ALGEBRAS 


there is a sequence {B,} C So such that 
co co 
IPa— Oa, 13, = 2 GIlBd;— Bnd? +o GIIB*S;— Bids? — 0 
j=l 9=1 


as n — 00; hence Bnd; > Bd; and Bro; — B*d; for all 7. Using the estimate 
|| B¢-Bndl| < ||B¢-B¢;|| + ||Bb;-Bnd;|| + ||Bnd;—Bnoll, we get the inequality 
lim sup,,_.0 ||B@—-Bnd|| < 2K \|¢—¢,|| for an arbitrary ¢ € H ; however, the set 
{;} is supposed to be dense in H, so the right side can be made smaller than any 
positive number. Thus it follows that s-lim,..B, = B ; the other relation can be 
proved in the same way. & 


6.5.5 Lemma: A weakly closed commutative *-subalgebra A C B(H) on a sep- 
arable H contains a sequence {E,} of mutually commuting projections which ge- 
nerates it, A= Aw({Ex}). 

Proof: Since A is a *-algebra by assumption, we have Ar C A C Ao(Ar), and 
therefore A = A,(A) = Aw(Ar). Using the notation of Lemma 3, we introduce 
See=— { Pi?) : A€ Ar, t € R}. This set is commutative by Problem 24 because 
Sp C (Ar). Moreover, Sp C Apr follows from Lemma 3 so Ay (Sp) C Aw(Ar) = 
A, and on the other hand by the same lemma, Sp C A,(Sp) yields Ag C Ay(Sp), 
i.e., together we get A= A,(Sp). Finally, Lemma 4 allows us to choose from Sp 
a countable subset {F,,} of mutually commuting projections. Clearly A,({Ex}) C 
A ; at the same time, for any E € Sp there is a subsequence {£x,} which converges 
to E strongly, and therefore also weakly. Hence Sp C ({Ex}),, C Au({Ex}), so 
Aw({Ex}) DAw(Sp)=A. 8 


Proof of Theorem 2: The just proved lemma tells us that algebra A is generated 
by a sequence {£,} of mutually commuting projections. To construct a spectral 
decomposition from it, we must replace {£,} by an ordered family of projections. 
We set F, := E,, and furthermore, Fo := E2F,, Fi+E2(I—F,). It follows from the 
commutativity of Fy, E2 that Fi, Fo, F3 are projections which commute mutually 
and belong to A; moreover, they satisfy the inequalities F, < F, < F;. In the 
next step we add the operator E3 and construct the commuting projections 


Fy = E3F ; Fs — Fo+E3(F\—F2) 5 Fe = F\+E3(F3—F}) , F; = F3+E3(I—-Fs3) ’ 


which belong to A and satisfy the inequalities Fy < Fy < F3< Fi << Fo < Fy < Fy, 
etc.;in the k-th step we add E, to the already constructed 2*-1—1 projections and 
“insert” the projection F” := F+E,(F’—F) between each neighboring F < F’ ; the 
left (right) neighbor of the minimal (respectively, maximal) projection is supposed 
to be zero (respectively, the unit operator). We have F < F” < F’; the operators 
FE, are obtained from the relation 


Pee J Fu | 


E,=. >) fF, See 


j=ok-l j=l 
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which shows that any homogeneous polynomial composed of the operators FE, can 
be expressed at the same time as a polynomial in the operators Fj, so the cor- 
responding weakly closed algebras coincide, Ay({Ex}) = Aw({F;}), in view of 
Example B.1.4. 

The sought spectral decomposition is now constructed as follows: we Pelee the 
interval [-1,0) into three equal parts and label the middle open interval (-3 at -1) 
as J,. Then we divide the remaining two intervals again into three parts and set 
Jo := (-§, -f) and Js := (2, -}) ; repeating this procedure we get a sequence 
{J;} of disjoint open intervals. We denote Mc := [—1,0) \ U2, Jj and associate 


with any t € IR \ Mc the projection 


Oia eet 
Fi = F; Ane te J; 
| Pe | 


The map t+ E; defined in this way on JR \ Mc is obviously continuous and non- 
decreasing; to get a spectral decomposition we have to extend it to the complement 
Mc. The latter is a Cantor—-type set, and therefore nowhere dense in JR. Hence for 
any t € RR there is a nondecreasing sequence {tn} C IR\ Mc such that t, ~t. The 
corresponding sequence of projections {E£;,} is also nondecreasing and converges 
strongly to a projection; we set FE; := s-lim,.../;, . The definition makes sense: if 
{t/,} is any other sequence converging to t , we have s-limn. My, = E;. Indeed, one 
can choose subsequences {t;,} and {t; } in such a way that fst) La a aac 
again nondecreasing, so the corresponding sequence of projections is nondecreasing 
and any subsequence of it converges to the same limit. It is now straightforward 
to check that the map t ++ F; is also right-continuous and nondecreasing at the 
points of the set Mc, i.e., that it is a spectral decomposition. 

The inclusion {Fj} Cc {E; : t < 0} in combination with Lemma 5 yields 
A= Au({Ex}) = Au({F;}) c A({ Ez: t < 0}) ; however, the set {EF : t < 0} 
belongs by construction to the weakly closed algebra A, so the opposite inclusion 
is also valid; together we get A= A({E,: t < 0}). We have I—E; = 0 for 
t > 0, and therefore {F,;: t <0} = {P\4)} | where A is the Hermitean operator 
corresponding to the spectral decomposition {E;,} (its spectrum is contained in the 
set Mc); it follows from Lemma 3 that A€A and A,({A}) = A,({Ph”}) = A 
In view of Theorem 6.3.4, this means that AC {A}", and Theorem 5.5.6 tells us 
that any element of the set {A}” c {A}%, can be expressed in the form y(A) for 
some Borel function y. & 


An analogous structural result is valid for more general operator families, in 
particular those containing unbounded self-adjoint operators. 


6.5.6 Corollary: Any commutative symmetric set S C Lb sa(H) on a separable 
H. has a generating operator A, which is contained in S”. 

Proof: If S contains unbounded operators, we construct to it the set S, C B(H) 
(cf. Problem 25). A function f7 is Borel iff the same is true for tan f7{-), and 
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moreover, S” = S”; hence it is sufficient to consider the case S C B(H). Due to 
the assumption, A,,(S) = Ao(S),, isan Abelian *-subalgebra in B(H) , so there is 
a Hermitean operator A € A,(S) which generates it, A,(S) C {A}”. This means 
that S c {A}", and since S” > A,(S) by Theorem 6.3.4, the operator A belongs 
to S’ 2) 


This result also extends to other operator sets, for which it still makes sense to 
speak of commutativity of unbounded operators (Problem 26). 


6.6 Complete sets of commuting operators 


Let S be a commutative subset in £..(#). The preceding section was devoted to 
internal properties of such sets; now we want to ask which conditions would ensure 
that S cannot be completed by “independent” commuting operators. Without loss 
of generality, the latter may be supposed to be bounded; then we have to require 
that the commutant S’ consists solely of functions of the operators from S, or 
in other words, S’ Cc S”. A set S with this property is called a complete set 
of commuting (self-adjoint) operators; for the sake of brevity we shall use the 
shorthand CSCO. 


6.6.1 Theorem: Let S be a commutative subset in £.,(H) where H is separable; 
then the following conditions are equivalent: 


(a) S is a CSCO, 
(b) the algebra S” is maximal Abelian, S’ = S” , 
(c) the set S has a cyclic vector, i.e., there is p € H such that S’p=H. 


Proof: In view of Theorem 6.5.6 and Problem 26b, there is a generating operator A 
of S such that S’ = {A}! and S” = {A}”. The implications (b) > (c) > (a) then 
follow from Theorem 5.8.6; it remains for us to check (a) > (b). By Problem 24, the 
set Sy is commutative and S’ = (S;)’. Hence we have Sy C (S;)’ which implies 
(S;)! D ~ t.e., S’ D> S"”. However, the set S is a CSCO by assumption, so 
S'= §". 


In view of Theorem 5.8.6, S is a CSCO provided it is generated by an —s 
A éS" having a simple spectrum. This is particularly illustrative if S = {A, ion 
finite and the operators A; have pure point spectra. We shall again assume that oes 
Hilbert space H is ropa the eigenvalues and the corresponding spectral projec- 
tions of A; will be denoted as \,’ and P,’’, respectively. Since S is gi to 
be commutative, all the eigenprojections commute mutually and Puy = a ee ) 
is a projection for any N-tuple {k} := {k,,...,kn} of positive integers; if i easy 
to check that the projections referring to diffonent N-tuples are mutually orthogo- 
nal. We are interested, of course, only in those {k} to which a nonzero projection 
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P(x} corresponds. We denote this set as Ky ; it is at most countable by assump- 
tion, so one can arrange it into a sequence, Ky = {{k}1,...,{k}n,...}, and set 
PY) ;= Pt} Fi 


6.6.2 Proposition: A set S = {A;}%, of commuting self-adjoint operators on 
a separable H with pure point spectra is a CSCO iff dim P = 1 holds for all 
{hb € Ky. 

Proof: To check the sufficient condition, we pick a unit vector ¢, from each subspace 
P™#H. In view of Problem 27, {¢,} is an orthonormal basis in H. We define 
W:= Dn 2-"bn ; the set M:= { Py: {k}, € Kn} is therefore total in H and 
its linear envelope is contained in S”w. It follows that H = Min C S"W CH, i.e., 
wy -is cyclic for S. 

Assume on the contrary that S is a CSCO with a cyclic vector w ; then for any 
nonzero ¢ € P'™H there is a sequence {B,} C S” such that Bny -+ ¢. Since 
S is commutative, Problem 27 yields ¢ = P™d¢ = limo B,P’™y ; the vector 
@ is nonzero so the same is true for P’”)¢. Suppose that the subspace P!(MH 
contains a vector y 1 P‘™2 ; the corresponding projection is denoted as E,. By 
assumption, x is a common eigenvector of the operators A;, so Ey, € S’ = 8S". 
Then By = BE,x = E,By holds for any Be S”, i.e., By = a(B)x. It follows 
that (By, x) = (By, P'™ x) = a(B*)(P™y, x) = 0 or x € {S"Y}+; however, o 
is a cyclic vector for S, and therefore y=0. &f 


Complete sets can have different cardinalities, but the most interesting are those 
which consist of a few operators only. The extremal case is a CSCO containing a 
single self-adjoint operator; in view of Theorem 5.8.6 this happens iff the opera- 
tor has a simple spectrum. The preceding proposition provides an illustration for 
operators with pure point spectra; recall some other important cases: 


6.6.3 Examples: (a) Each of the sets {Q} and {P} isa CSCO in L?(JR) — see 
Examples 5.8.2, 4.4.7, Theorem 5.5.6, and Problem 5.12. 


(b) More generally, due to Example 5.8.2, {Q,} isa CSCO on L?(R,du) for any 
Borel measure p» on RA. 


In other situations a “natural” CSCO consists of several operators. An example 
is given in Problem 28; in the same way we can prove the following more general 
result. 


6.6.4 Proposition: Suppose that self-adjoint operators A; on separable Hilbert 
spaces H,; have cyclic vectors ¢;, j = 1,2; then the set {A,,A,} is a CSCO on 
H, @ He and ¢) ® ¢ is its cyclic vector. 


Our aim is now to extend this result to the situation, where the CSCO in each 
of the spaces ; need not consist of a single operator. First, for symmetric sets 
S; C B(H;), j = 1,2, we introduce the following notation: S; := {B;: B; € S;} 
and Sy := S, US»; alternatively we shall write S;@Jo:=S, and 1; @S2 := So. 
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6.6.5 Theorem: The W*-algebras generated by symmetric sets S; C B(H;), j= 
1,2, and by the set Sp C B(H; © He) are related by 


Aw(S;US;) = Aw(Si) @ Aw(S2). 


Proof: As the first step, we shall derive the identities 
(S))"= SY@h, (S2)"=h@sy. (6.6) 


Let {ta} be an orthonormal basis in Hz, and denote by E‘*) the corresponding 
one-dimensional projections. We define the unitary operators Va : Hi — Hi ® 
BO, = Ee H by Vad := 6@ Ya, and extend the inverse operators to the whole 
H := H, ® He setting Vix := Vig x. It is easy to check that V, and Vit 
as well as VaVg for any a,( are partial isometries (cf. Problem 30). We shall 
show that an operator B € B(H), which commutes with all V,Vj" , is of the form 
B, @ Ig for some B,; € B(H,). Due to the assumption, there is -y such that 


Vit BV = ViV, Vt BVy = VABV,VjtVa = baaV +BY, ; 


putting a = G we get V+ BV, = V+ BV, := B, forall a. Since V,, Vi are partial 
isometries, B, belongs to B(Hi). Furthermore, the relation V3'BV, = 6baBi 
implies 


B($@va) = SY VaV Fi B(OOVa) = Yo VebasBid = BidSya = (Bi @lz)(O9ex) ; 
8 B 


however, the two operators are linear and bounded, so B = B; ® I, follows from 
their coincidence on the total set Hx {wa}. 

To prove the relations (6.6) we define the set V(S,) := { B € B(H): V7 BV, € 
S; U {0} for all a, 8}. Using Problem 30, we find that 


CB > VaVF(C@l) Ve BVYIVS = 5 ye(veBVA)V+ 
By 


«8,7 


holds for any B € B(H) and C € B(H;). In the same way we can express the 
product BC; ; it shows that the operators B, C, commute iff the same is true for C 
and V; BV, with any a,@. Choosing B € V(S,),C € Si, and Be (S,),C ES), 
we get from here respectively the inclusions 


Si @InCV(S\)' and (S,) C VIS). 


Suppose for a while that /,; € S, ; then the operators V,V3;* belong to V(S;) for all 
a, and each element of the conmutant V(S,)! is of the form C @ I,. However, 
such an operator commutes with all Be V(S\) iff [C, Vj BV] = 0 holds for all 
a,B, we, if Ce S,. This fact, together with the first of the above inclusions, 
yields the identity S| @ Iz = V(S,)! which is valid under the condition I, € S). 
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Hence we may apply it to the commutant S{, which contains the unit operator, 
obtaining V(S})! = S{ @ Iz D S,. This in turn implies V(S{) C V(S{)”" Cc (S,)! 
which, in combination with the second of the above inclusions, gives V(S{) = (S;)’. 
Then the commutants of these sets also coincide, so we finally arrive at the first. of 
the relations (6.6), (S,)” = V(S{)! = S! @ Iz ; the other identity is checked in the 
same way. 

The rest of the proof is easy. In view of (6.6), any operators B; € S/ = 
Aw(S;), 7 = 1,2, belong to (§;)”, so they commute with any C é (Sp)! =(S yn 
(S»)’ and [B, ® B,, Ow [B,B,,C] = 0. Hence By ® Be € (Ss)”, which implies 
the inclusion Aw(Si) ® Aw(S2) C Aw(Sr). At the same time, Ss = S; US, C 
{ B, ® By: B; € Aw(S;)} ; combining these inclusions with the definition of the 
W*-algebra tensor product, we conclude the proof. §& 


The proved theorem provides the sought extension of Proposition 4. 


6.6.6 Corollary: Let S;, 7 = 1,2, be a CSCO on a separable Hilbert space 4, ; 
then the set S:= {A,: A; € S;}U{A,: Ap € Sp} is a CSCO on H, @ Hp. If 
o; is a cyclic vector of the set S;,7= 1,2, then $1 ® @2 is cyclic for S. 

Proof: Assume first that S; C B(H;) ; then the operators A; are closed and 
S = Ss. Asa CSCO, each of the sets S; has a cyclic vector which we denote as ¢;. 
Since { B;p; : B; € Sj} is total in H;, the set { Bibi ®@ Bodo: B; € Si} is total 
in H := H,;@H2. Moreover, {B,@ Bz: By € SZ }in C { Bi @ Be: B, = Si}! = St 
where the last relation follows from the arent so 


H = { Big, ® Boda: B; € Sf hin C S"(b1 @ be) CH. 


If the sets S; contain unbounded operators, we construct to them the sets Sj_ C 
B(H;) as in Problem 25. We have S/ = Sj, ; hence Sj; is a CSCO with the same 
cyclic vector as ¢;, and by the alweadly proven result, S;,US2_ is a CSCO with 
the cyclic vector ¢; © ¢2. Finally, we use Problem 5.28c due to which arctan A; = 
arctan A;, 7.€., Si, USoqg = Sa, and since Sj =S" the proof is completed. & 


6.7 Irreducibility. Functions of noncommuting 
operators 


In Section 4.4 we have introduced the notions of invariant subspace, irreducibility, 
etc., for a given operator; now we are going to do the same for operator sets. We 
say that a subspace L C H is an invariant subspace of an operator set S if it is 
an invariant subspace of any operator T € S. A closed subspace G C H reduces 
the set S if it reduces each operator of this set; we shall again say alternatively 
that S is reduced by the corresponding projection Es. An operator set S is 
called irreducible if there is no nontrivial closed subspace which reduces it; in the 
opposite case it is called reducible. 

Irreducibility is equivalent to nonexistence of a nontrivial projection, which 
would commute with all the operators of S. In the case when S is symmetric 
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we have a simpler criterion analogous to Problem 4.24a (see Problem 33a). If all 
the involved operators are bounded we need not worry about their domains: a sym- 
metric set S C B(H) is reducible iff it has a nontrivial closed invariant subspace. 
It is also obvious that for a pair of operator sets S; C S2 reducibility of S2 implies 
reducibility of S,, and conversely, irreducibility of S2 follows from irreducibility of 
the set S;. 

If a closed subspace G C H reduces an operator set S, we can construct the 
sets Sg := {T)G: T € S} and Sg. ; any operator T € S is the orthogonal 
sum of its parts in the subspaces G, G+. The sets Sg and Sgi have the same 
cardinality as S ; however, their structure is in general less complicated since they 
act on “smaller” spaces. If at least one of them is reducible we can continue and 
take the next step; however, it may happen that we never end up with irreducible 
sets (even if S consists of a single operator — see Example 2b below). A good 
illustration of the reduction process is provided by the classification of W*-algebras 
described in Section 6.3. 

It is not always convenient to check irreducibility of a given operator set directly 
from the definition, so we want to find an equivalent expression to be used instead. 
Two simple criteria can serve this purpose. 


6.7.1 Theorem: Let S C Losa(H), dimH > 1, be a symmetric set; then the 
following conditions are equivalent: 


(a) S is irreducible, 
(b) the commutant of S consists of scalar operators only, S’= C(H), 
(c) any nonzero vector ¢ € H is cyclic for S”. 


Proof: We shall check the chain of implications (a) > (b) > (c) = (a). Suppose 
that S is irreducible and C € S’. If C is a projection it must equal either 0 
or 1. If C is a Hermitean operator with the spectral decomposition {E,}, then 
Theerem 5.3.1 implies {C'}’ = {B)}’ so {By} c {By = {CP c SS’. Sime 
the spectral decomposition is monotonic and cannot contain nontrivial projections, 
there is a Ao € IR such that BE, = 0 for A < XA end Ey, =_7 for ® > A, 
t.e., C = Aol. Finally, let C be any bounded operator from S’. The set S is 
symmetric by assumption, so S, is also symmetric and the same is true for S’ = e,. 
The Hermitean operators Re B and ImB belong to S’; hence they are scalar and 
C is also scalar; in other words, we get (b). 

The relation S‘ = (H) implies S” = B(H), so S"6 =H holds for any nonzero 
vector @ € H, and the latter is therefore cyclic. Finally, suppose that the set S is 
not irreducible; then there is a nontrivial projection E € S’ = (S”)’. Any nonzero 
vector @ € EX then satisfies S’¢ C EH, which means that it cannot be cyclic for 
the set S”. §f 


6.7.2 Examples: (a) A symmetric commutative set S C Ly sa(H) is always re- 
ducible provided dim# > 1. It is sufficient to assume that S contains at 
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least one nonscalar operator T since otherwise we would have S Cc €(H) and 
S’ = B(H). If T is bounded it belongs to S’. If it is unbounded self-adjoint, 
S' contains its spectral decomposition {E,} and as T is not scalar by as- 
sumption, we can find at least one nontrivial projection in S’. In all cases 
therefore S’ # ((H), so the claim is proved. 


(b) By the previous example, a set {A} consisting of a single self-adjoint operator 
is reducible unless dim# = 1, and the same is true for {A} F4(M)H } 
where M is any Borel set in IR. Hence the complete decomposition of {A} 
to irreducible (one-point) sets is possible only if A possesses a total set of 
eigenvectors, 7.e., a pure point spectrum. 


{c) Consider the right-shift operator S on an infinite-dimensional separable 
Hilbert space # with an orthonormal basis {¢;}%2, introduced in Exam- 
ple 3.1.3. We want to prove that the pair {S,5*} is an irreducible family. 
All operators C € {S,S*}' satisfy S*C¢, = CS*d, = 0, and therefore 
Cd, = ad, because KerS* = {¢)}yin. Then any basis element satisfies 
Co; = CS3-'¢, = S1-'C¢, = ad; , and since C € S’ is bounded by defini- 
tion, we get C=al. 


(d) It follows trivially from the previous example that the set B(H) is also irre- 
ducible. This is true, of course, for any dimension of : the operators Pyg of 
Problem 3.6 are bounded for any ¢,7% € H and therefore any nonzero ¢ € H 
is cyclic for B(H). 


(e) We shall prove finally that the operators {Q,P} form an irreducible set on 
L?(UR). We know from Example 6.6.3a that {Q} is a CSCO, which means 
that for any operator B € {Q,P}’ there is a bounded Borel function fz 
such that B = fg(Q). Furthermore, BP Cc PB holds by assumption, so 
the function By = fp(-)v(-) belongs to D(P) for all ~ € D(P). To any 
bounded interval J C IR we can choose ~; € D(P) such that w~y(xr) = 1 
for x € J; it follows that fg is absolutely continuous in RR. The inclusion 
BP Cc PB implies, in addition, 

—ifp(z)y'(x) = (BPy)(x) = (PBY)(z) = —i(fa(z)y(2))’, 
i.e. fp(x)v(x) = 0 for all x € D(P) anda.a. x € R. In view of the absolute 


continuity of the function fg, we end up with fp(x) =c for all z € R, which 
means that B is scalar. 


Now we want to find a way to construct an irreducible set on the tensor product 
H, © Hz from irreducible sets S; on the “component” Hilbert spaces H;, j = 1,2. 
It appears that we can use the same construction for that purpose as in the case of 
commutative sets discussed in the preceding section. 


6.7.3 Theorem: Let S; C L;,sa(H;) be symmetric irreducible operator sets on 
Hilbert spaces H;, 7 = 1,2; then S:= {A,: A; € S;}U{ AQ: Ao € So} is an 
irreducible set on #11 ® He. 
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Proof: Due to the assumption, S} = C(H;) so Si = B(H;). First, let Sy C BGG); 
then S = Sy and Theorem 6.6.5 implies Aw(Ss) = (Sz)" = S/ ® Sy = B(Hy) @ 
B(H2), and therefore (Ss)! = (B(H1) ® B(H2))’ = C(Hi @ He) (see the notes to 
Section 6.6). Moreover, it is obvious that Sy is symmetric; hence it is irreducible by 
Schur’s lemma. If some of the sets S; contain unbounded operators we use the sets 
Sj as in the proof of Corollary 6.6.6: we have S” = St = (S1,a@S2,0)" = St,g@S20 5 
and since S/, = Si = B(H;), the set S is again irreducible. [J 


6.7.4 Remark: It is not difficult to see that the proved assertion is nontrivial: 
other sets consisting of simple combinations of the operators from S,, Sz need not 
be irreducible. As an example, consider the sets S; = Sp = {7(L;): 7 =1,2,3} C 
B(C?) with m(L;) := 30;, where 0; are the Pauli matrices, and define 


m(L;) := m(L;) @1+1@x(L;) 


and m(L?) := D3_,7(L;)? on the tensor product C? @€? = C*. The last 
named operator is nonscalar (we leave the proofs to the reader — Problem 34), 
and commutes with all the m5(L;), so the set Sig := {ay(L;) : 7 = 1.2.3} is 
reducible. On the other hand, as(L*) does not commute with the elements of 
Sy := {x(L;) @1, 1 @x(L;) : 7 = 1,2,3}, as should be the case, because each 
m(L;) has a simple spectrum, so the sets S; are irreducible and by Theorem 3 the 
same is true for Sp. 


There is also another viewpoint from which irreducibility can be discussed. We 
know that if S = {A,,..., An} isa finite family of commuting self-adjoint operators, 
its bicommutant consists exclusively of functions of the operators A,,...,An. The 
situation is considerably more complicated if the operators do not commute. The 
only naturally defined functions are then polynomials, and even in that case caution 
is needed because such a polynomial need not be densely defined if some of the 
operators A; are unbounded. 

This difficulty can, however, be overcome by replacing the A; by their bounded 
functions and taking polynomials of these operators, i.e., the algebra Ao(S,) for the 
set S; defined in Section 6.5; recall that the unit operator as a bounded function 
of any A; is contained in Ao(S;). Now by Theorem 6.3.4 and Problem 24, this set 
generates the W*-algebra (S;)” = S”, which means that any operator of S” can 
be approximated by the polynomials from Ao(S;) in the weak operator topology 
(and also in the strong, o-weak, and o-strong topologies; it should be remembered 
that none of them is first countable, so the approximation can be realized by a net, 
but in general not by a sequence). 

In this sense therefore the elements of S” can be regarded as functionally de- 
pendent on the operators A,,...,A,. The importance of irreducible sets now stems 
from the fact that if S is irreducible, then this conclusion extends, due to Theo- 
rem 1, to any bounded operator on H. For instance, Example 2e tells us that 
every B € B(L?(JR)) can be approximated by polynomials in bounded functions of 
operators Q and P. 
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On the other hand, there is no general way to introduce a functional calculus for 
noncommuting operators. To illustrate the difficulties which would arise, consider a 
pair of noncommuting Hermitean operators A,, A2. The function F’: f(x,y) = xy 
is a polynomial, so it should be associated with an analogous operator polynomial; 
however, both A;A2 and A2A, are candidates for the role of f(A,, A2) as is more 
generally ~A,;A2+(l—a)A2A; for any a € R. The function f is real-valued so the 
corresponding operator should be Hermitean; this requires f(Ai, Ao) = (Ai Ag+ 
A2A;) but such a map is not multiplicative, because 


(fP)(Ar, Aa) = S(ATAR+ ABAD) # Z(t AdAL)? = (f(A1, A)? 


Hence the considerations of Sections 5.2 and 5.5 cannot be extended to the case of 
noncommuting operators; if the symbol f(A1,...,A2) is nevertheless used in such 
a situation, it is always a shorthand for a particular operator. 


6.8 Algebras of unbounded operators 


We have seen how useful algebraic methods are for the analysis of operator sets. 
It is natural to ask whether some of these notions and results can be extended to 
the case of unbounded operators. Since the problem is, of course, technically more 
complicated we limit ourselves to a few remarks; in the notes we give a guide for 
further reading. 

If a set S C L(H) has to be equipped with an algebraic structure we must 
ensure that the algebraic operations make sense for its elements. A way to do that 
is to assume existence of a common dense invariant subspace. Let D be a dense 
subspace in a Hilbert space H ; then the symbol L+(D) will denote the set of all 
operators in £(7#) such that 


(i) D(T) =D and D is invariant with respect to T, TDC D, 
(ii) the adjoint T* exists, D(T*) > D, and T*DC D. 


The set £*(D) has the structure of a *-algebra: the sums and products of the 
operators from L+(D) belong to £*+(D), and the involution can be defined by 
T + Tt, where T+ := T*)D. Any subalgebra A C L*(D) which contains 
the unit operator is called an Op*-algebra. Algebra A is called closed if D = 
Ares D(T) and self-adjoint if D = Qre4D(T*). We also say that A is standard 
if any symmetric operator A € A is e.s.a. 


6.8.1 Example: Consider again the operators Q,P on H = L?(JR). By Exam- 
ples 1.5.6 and 4.4.7, the subspace S := S(UJR) is dense in H and invariant with 
respect to Q,P; hence Qs :=Q)S and Ps := P}S belong to £t(SUR)). Now 
consider the algebra A consisting of all nonhomogeneous polynomials in Qs, Ps ; 


using the relation 
QPp— PQY = —i (6.7) 
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we can rewrite it as A= {T = D?,-1 ajeQ? P* : aj € C, n positive integer } . It 
follows from Proposition 4.1.2 that T+ = DF ,a1 aj~rP*Q? . Using the commutation 
relation (6.7) we can express this operator as T+ = D7, &jnQ? P* , which means 
that A is an Op*-algebra; it can be proven that it is vekonenl and self-adjoint (see 


the notes). 


The algebra A defines a topological strucure on D. It is easy to check that the 
family of seminorms { pr = ||T-|| : T € A} separates points, and the corresponding 
topology 74 is the weakest locally convex topology in which any operator T € A 
is continuous as a map (D,T,4) ++ H. Since I € A by assumption, T,4 is stronger 
than the topology induced by the Hilbert-space norm of H in D. Further, let M 
be the family of all bounded sets in (D,74) (see the notes to Sec.1.4); then we 
associate with any M € M the seminorm sy: sm(T) =supgyem \(¢,TY)|. The 
system {sy: M € M} separates points; hence it defines on A a locally convex 
topology Tp which is called uniform (see Problem 35 and the notes). 


6.8.2 Theorem: An Op*-algebra AC Lt(D) with the topology rp is a topolo- 
gical +—algebra. 


The algebra (A,7p) is called an O*-algebra. Some results which are valid in 
algebras of bounded operators can be extended to Op*-algebras and O*~-algebras. 
We mention several of them in the notes; here we limit ourselves to a few words 
about states on Op*-algebras. 

By Proposition 6.2.2e, any positive functional on a C*~algebra is continuous. 
This suggests how to formulate the definition in the case of unbounded operators: 
a state on an Op*-algebra A Cc Lt(D) is a positive functional F : A -— € 
such that f(/) = 1 ; the positivity here means f(T*+T) > 0 for all Te A. A 
linear functional g : A — ( is said to be strongly positive if g(T) > 0 holds 
on the set Ay := {T € A: (¢,T¢) > 0, 6 € D}. Im the case of bounded 
operators the two notions coincide in view of Theorem 6.2.1; generally one has only 
P(A) :={TtT : T € A} C A,, so any strongly positive functional is positive but 
the converse is not true. 

We want to know whether a state on an Op*-algebra A Cc Lt(D) can again 
be expressed by means of a statistical operator, as in Theorems 6.4.1 and 6.4.7. We 
shall mention one assertion; for its proof and related results we refer to literature 
quoted in the notes. We denote 


N(A) = {We B(H): WT, WT € 9,(A), TE A, andWHC D, WH CD} 


and Ji(A)4 := {W € Ji(A): W > 0}. We have I € A, so J(A) is a subset of 
J(H) and Ji(A), C T(H)4 ; we also denote 7, := Te+ py. 


6.8.3 Theorem: Let (D,7,) be a Fréchet space; then the following conditions are 
equivalent: 


(a) any bounded set in (D,71) is precompact, 
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(b) if f is a strongly positive functional on A and T,4 = 7,, then there is an 
operator W € J,(L*(D)), such that f(T) = Tr(WT) = Tr(TW) holds for 
all Te A, 


(c) any strongly positive functional f on £Lt(D) is of the form f(B) = Tr (FW) 
for some W € (,(Lt(D)),. 


Notes to Chapter 6 


Section 6.1 The notion of C*—algebra was introduced by I.M. Gel’fand and M.A. Naimark 
in 1943. Some mathematicians use the term B*-algebra in the same sense (see [ Ru 2], 
Chap.11, [ Ti]); a normed *—algebra fulfilling condition (6.1) is called a completely reg- 
ular algebra — cf. [Nai 1], Secs.16 and 24. The original definition also included the 
requirement that e+a*a must be invertible for all a € A; only later it became clear that 
this is fulfilled automatically — see Theorem 6.2.1 and [BR 1], notes to Chap.2. Let A 
be the extension obtained by addition of the unit element to a C*—algebra A ; then there 
is a unique extension of the norm of A, which makes A a C*-algebra. This extension 
is given by ||[a,a]|| g := supyey<: ||ab+ab|| (cf Problems 1 and 3). It is slightly more 
complicated to check that such a norm has the C*—property — see, e.g., [Di 2], Sec.1.3.8. 

Theorem 5 is the main result of Gel’fand’s theory of commutative Banach algebras, 
which is discussed, e.g., in [BR 1], Sec.2.3.5; [Mau], Chap.VIII; [Ru 2], Chap.11; [Sa], 
Sec.1.2.; [Si 1], Sec.4. The space A consists of all multiplicative functionals on A, i.e., 
such that f(a)f(b) = f(ab), and the isomorphism A — C(A) is given by the so-called 
Gel’fand transformation, a@(f) := f(a) for all f € A. The application of Gel’fand’s 
theory to proof of the spectral theorem is discussed in [Mau], Sec.IX.2; [Nai 1], Sec.17.4; 
[Ru 2], Chap.12. 

If we define a C*—algebra by factorization with respect to an ideal, the only nontrivial 
thing is to check that the factor norm introduced in Appendix B.2 has the property (6.1) 
— cf. [BR 1], Prop.2.2.19 or [ Di 2], Sec.1.8.2. Instead of “direct sum”, the term C*- 
product of the algebras Ag, a € I, is sometimes used — see [Di 2], Sec.1.3.3. The direct 
sum of C*-algebras does not coincide with the direct sum of the Banach spaces Ag — 
cf. Section 1.5. The definition of the tensor product is adopted from [ Di 2], Sec.2.12.15; 
[BR 1], Sec.2.7.2. In general, there are more candidates for the role of the tensor product 
(see the notes to Sec.2.4); a discussion can be found in [Lan 1]. 


Section 6.2 The definition of a cone appears in the literature in various modifications — 
see, e.g., [ Nai 1], Sec.3.10; [RS 1], Sec.IV.4. The proof of Theorem 1 uses the functional 
calculus on C*-algebras; it can be found in [BR 1], Sec.2.2.2; [ Di 2], Sec.1.6; [Ru 2], 
Sec.11.28; [Sa], Sec.1.4. Proposition 2 remains valid in Banach *-algebras with the unit 
element but the proof is more complicated — cf. [Ru 2], Sec.11.31. Other properties of 
positive functionals are discussed in [ BR 1], Sec.2.3.2. 

The GNS construction was formulated in the papers [GN 1], [Seg 1]. Due to 
Theorem 3 all GNS representations are unitarily equivalent, so it is possible to speak 
about properties of such a representation without a more detailed specification. The 
sketched proof of Proposition 5 is given with all details in [ Ru 2], Thm.12.39; see also 
[BR 1], Lemma 2.3.2; [Di 2], Thm.2.6.1. 
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A pure state f on A can be defined equivalently as such that the inequality f 29, 
where g is a positive functional, implies g = Af, A € {0,1] — see [ BR 1], Sec.2.3.2. 
Nonextremal points of S4 are called mized states. Theorem 7 was proved for the first 
time in {Seg 1]; see also [ BR 1], Thm.2.3.19; [Di 2], Sec.2.5 or [Si 1], Sec.5. For 
a necessary and sufficient condition under which a pure state can be extended from a 
subalgebra BC A to the algebra A — cf. [And 1]. 


Section 6.8 The topologies introduced in Remark 1 are sometimes also called ultrastrong 
and ultraweak; however, these notions may be misleading because Toy is stronger than 
Ty. The fact that all the five topologies are mutually different in an infinite-dimensional 
Hilbert space is proven in [ Di 2], Sec.I.3. This still does not exhaust all nonequivalent 
locally convex topologies on B(H) ; another pair of them can be found in [BR 1], Sec.2.4.1. 

The foundations of the theory of W*-algebras were formulated in a series of papers 
by J. von Neumann starting from 1929. There are numerous monographs devoted to this 
subject, e.g., [BR 1], Chap.2; [Di 1] and Appendix A to [Di 2]; [Em], Chap.2; [Nai 1], 
Chap.VII; [Sa] , etc., where additional information can be found. Corollary 5 provides 
us with an alternative way to define a W*-algebra; in that case the condition J € A is 
sometimes dropped — see [ Nai 1], Sec.34.1. Another easy consequence of Theorem 4 is 
the so-called bicommutant theorem: let A be a nondegenerate *-subalgebra in B(H) ; 
then the following conditions are equivalent: 


(a) A=A’", 
(b) A is closed in any of the topologies Ty, Ts, Tow, Tos 
(c) A is closed in all of the topologies Ty, Ts, Taw) Tos - 


This result can be extended to some other topologies on B(#1). Moreover, the conditions 
(a)-(c) are equivalent to closedness of the unit ball, A; := {B eA: ||Bl| < 1}; see 
{BR 1], Thm.2.4.11; [Di 1], Sec.1.4. Still another consequence of Theorem 4 is the 
von Neumann density theorem (Problem 9); a stronger result is the Kaplansky density 
theorem — cf. [BR 1], Thm.2.4.16; [ Di 1], Sec.1.5. 

There is also an abstract way to define W*-=algebras as a particular class among 
all C*-algebras with the unit element. Given a W*-algebra A C B(H) we define its 
predual A, as the set of all o-weakly continuous linear functionals on A; since any such 
functional is norm continuous we have A, Cc A*. It can be shown that .A, is a Banach 
space with respect to ||- ||4* and A is isometrically *-isomorphic to the dual space of 
A, by vy: (y(a))(f) = f(a) — see [BR 1], Prop.2.4.18; [Si 1], Thm.6.4. We usually say 
briefly that A is dual to A,; hence the name. The converse to this result is the so-called 
Sakai theorem: A C*-algebra with the unit element can be represented isometrically by 
means of some W*~algebra on a Hilbert space H iff it can be identified with a dual of 
some Banach space (cf. [Sa], Thm.1.16.17). This can be used to formulate the mentioned 
abstract definition. 

The described classification of W*-algebras was formulated in a series of papers by 
F.J. Murray and J. von Neumann, published in 1936-43. The proofs of Proposition 8 to 
Theorem 14 can be found, e.g., in [ Di 1], Secs.I.6-9, III.1-3; | Nai 1], Secs.36-38, or 
[Sa], Chap.2 together with many details and related results. The type of a given factor 
can be determined equivalently using the relative dimension d4, which is defined as the 
restriction of a normal faithful trace on A to the set A*® of all projections in .A ; this 
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makes sense because any two normal faithful semifinite traces on A are proportional to 
each other. The function d, has the following properties: (i) d4(F)=0 iff E=0, 
(ii) d4( £1) = d4(E2) if E,, EB, are equivalent with respect to A, (iii) d4(E1) < d4(F2) 
if Ey < Fe, (iv) d,(E}+E2) = d4(E£\)+d,4(E2) if EE» ad 0, and finally, (v) d,(E) < oo 
for all E € A” iff the reduced algebra Ag is finite. The range of the relative dimension 
determines the type of a factor; for different types, with a suitable normalization, the set 
d,(A®) equals 


AMD io ey Thy ee dy 

HO ees OO}, ose dns 
[0, 1] aoe 8S 
[0, co] se dee 
{0, oo} ae, 


Type I algebras usually appear in quantum mechanical applications (see Sections 7.4 and 
7.6 below) while in quantum field theory and statistical physics we often meet algebras of 
type III — see, e.g., [Ara 1] or [BR 1], Sec.5.2 and the notes to it. This has stimulated 
a search for a finer classification of these algebras; it appears that there are infinitely many 
nonisomorphic factors of type III, the so-called type III, for A € [0,1]. More details 
are given in {Con 1] or [ BR 1], Sec.2.7.3, where some results concerning a classification 
of C*-algebras are also mentioned. Numerous examples of W*—algebras of types IJ and 
ITT are given in [Sa], Chap.4. 


Section 6.4 The direct sum }°y¢; Eo in the definition of a normal state belongs to A 
due to [Di 1], Appendix II. A state f on A is normal iff f(supgcy Aa) = SUDgey f (Aa) 
holds for any nondecreasing net {Ag} C Ay — see, e.g., [Kad 1] or [BR 1], Sec.2.3.4; 
[Di 1], Sec.I.4. This property is often used as a definition of normal states, while the states 
which satisfy relation (6.2) are called completely additive. Another equivalent formulation 
is the following ([BR 1], Thm.2.7.11): f is normal iff f(A) = D§21 f(A;) holds for any 
sequence {A;} C A; such that A := w-limnsoo D214; € Ay. This is not surprising 
since the right side of relation (6.2) is finite only if there is at most countable number of 
nonzero values f(E.). 

The presented proof of Theorem 1 is adapted from | Di 1]; a somewhat different proof 
can be found in [BR 1], Thm.2.4.21. The result easily extends to o—weakly continuous 
functionals on A, which are neither positive nor normalized (Problem 17). Notice also that 
any state is normal if dim < oo ; then only assertion (c) provides nontrivial information. 
The proof of Lemma 3 leans on the following nontrivial result: a linear functional f ona 
weakly closed subspace M € B(H) is o-weakly (a-strongly) continuous iff its restriction 
f Mj, to the unit ball is o—-weakly (respectively, o-strongly) continuous — cf. [Di 1], 
Sec.1.3 for more details. 

The GNS representation corresponding to a normal state need not in general be faithful 
as Example 6.2.4c illustrates. Theorem 8 comes from the paper [Ca 1], where an example 
can be also found of a faithful representation of the factor algebra B(H)/K(H), which 
proves the existence of noninjective representations of B(71). 


Section 6.5 The term “symmetric set” is often replaced by *-invariant set, which is 
more appropriate in a sense when the set contains unbounded operators which are not 
closed. The main result of this section is Theorem 2, which is usually called the theorem 
on a generating operator; it was proved first by J. von Neumann — see [Di 1], Sec.1.7; 
[RN], Sec.IX.1 or [AG], Secs.90, 92. In the case of a separable H, Theorem 5.5.6 allows 
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us to define the generating operator equivalently by the requirement S C {A}f,. 


Section 6.6 Given a subset S of B(H) we call 6 € H a cyclic vector if Ao(S)@ =H. 
In particular, if S is a *-algebra containing the unit operator, then ¢ is cyclic for Ss 
iff it is cyclic for S” (Problem 32). The generating vector of a Hermitean operator A 
defined in Sec.5.8 is at the same time a cyclic vector of algebra {A}” — cf. Problem 5.43. 

Problem 28 has an interesting consequence. If A € S” is a generating operator of 
the set S = {Q1,Q2}, then there are real-valued Borel functions g and f; such that 
A=(Q1,Q2) and Q; = f;(A), 7 = 1,2. Changing the functions fjog on a (41 @ H2)- 
zero set if necessary we can achieve that the relations f;(g(x1,%2)) = 2; hold for all 
[x1, 2] from any fixed bounded interval J C IR? ; in this way we get a bijection between 
J and a certain subset of IR. A similar map, the so-called Peano curve, is used in set 
theory to prove that IR? and JR have the same cardinality — see, e.g., [vN], Sec.II.10, 
or [ Al], Sec.5.2. 

If B € B(H;@H2) commutes with all operators of the form 1; @C with C € B(H2), 
it commutes particularly with Vo Vi defined in the proof of Theorem 6.6.5, and therefore 
B = B,@1Ib for some B, € B(M), i.e., we get the relation (€ (Hi) ® B(H2))’ = 
B(H1) ® C(H2). In a similar way, we find (B(H,) @ B(H2))’ = C(H1) @ C(H2). These 
two identities represent particular cases of the relation (A; @ A)’ = Ai ® Aj, which is 
valid for any semi-infinite W*~algebras A, Ag — see [Di 1], Sec.1.6. 


Section 6.7 The definition of irreducibility of operator sets corresponds to the one— 
operator case discussed in Sec.4.4. It is worth mentioning that this terminology does not 
fully correspond to that used in the representation theory of groups and algebras (cf. 
Appendix B.1). In view of Problem 33a, a representation is irreducible iff its image is 
an irreducible operator set, provided the latter is symmetric; this is true, for instance, if 
we deal with representations of *—algebras or unitary representations of groups. In the 
general case those representations whose image is a reducible set are called completely 
reducible — see, e.g., [Zel]}, Sec.16. 

There is another viewpoint from which representations can be classified. A represen- 
tation a of an algebra A by means of linear operators on some vector space V is called 
algebraically irreducible if (A) has no nontrivial invariant subspace. If V is a topological 
vector space and 7(A) has no nontrivial closed subspace, 7 is said to be topologically 
irreducible — cf. | Di 2], Sec.2.3; [ Kir], Sec.7.1; [Nai 3], Sec.III.4. It is clear that the 
algebraic irreducibility of a given representation implies its topological irreducibility, while 
the converse is not true even if 7(.A) C B(H) in spite of Problem 33d, because it is not 
ensured that L # # unless dimH < oo. Irreducibility of representations in the following 
always means the topological irreducibility; in the case of representations by unbounded 
operators the definition contains additional requirements on the domains of operators from 
m(A) — cf. [BaR], Secs.11.1 and 16.5. 

The irreducibility criteria of Theorem 1 are standard — see, e.g., [BR 1], Prop.2.3.8 
or [ Di 2], Prop.2.3.1; we exclude the trivial case dimH = 1. The equivalence of condi- 
tions (a) and (b) is usually referred to as Schur’s lemma. On the other hand, in group 
representation theory this term often means the following implication ({ Ham], Sec.3.14; 
[ Boe ], Sec.I.7): if a set S of operators on a finite-dimensional vector space V has no 
nontrivial invariant subspace, then S’ = €(V) (see also Problem 33d); the same is true 
even if V has a countable algebraic dimension — cf. [ Kir}, Sec.8.2. The name is some- 
times also used for a more general result about intertwining operators between a pair of 
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representations of the same group or algebra — see [Ham], Sec.3.14; [Zel], Sec.20; [Kir], 
Sec.8.2. An analogy of the Schur lemma for real vector spaces of a countable dimension 
is given in [ Kir], Sec.8.2; a generalization to operator sets in a real Hilbert space can be 
found in [CM 1]. 

The conclusion about expressing each bounded operator by means of an irreducible set 
can be strengthened substantially in a finite-dimensional vector space V. The symmetry 
assumption is not required (after all, it makes sense only if V is equipped with an inner 
product) and the irreducibility condition can also be modified: if S C L(V) has no 
nontrivial invariant subspace, then any (linear) operator on V is a polynomial in the 
elements of S, t.e., Ao(S) = L(V) ; this result is usually called the Burnside theorem — 
see, e.g., [ Zel], Sec.21. 

‘The fact that functional calculus cannot, in general, be introduced for noncommuting 
operators does not mean that there are no reasonable definitions of functions for particular 
classes of noncommuting self-adjoint operators. A prime example is provided by operators 
P and @ for which one can use the Fourier transformation to introduce a family of their 
functions called pseudo-differential operators — see, e.g., | Hor]; they have recently been 
applied to semiclassical analysis of Schrédinger operators — cf. [Hel]. 


Section 6.8 The theory of unbounded operator algebras has two main sources of inspi- 
ration: the representation theory of Lie algebras and algebraic quantum field theory. We 
might add that it has not yet reached its maturity; the results were found only in jour- 
nal literature until recently the monograph [Schm] appeared covering the subject. The 
reader can find a much deeper treatment there; we also refer to the review papers [Bor 1], 
- [Las 3] and [ Vas]. The notion of Op*—algebra was introduced by G. Lassner (1972); 
properties of these algebras are discussed, e.g., in {Epi 1], {Las 1-3], {Pow 1], [VSH 1], 
and in the other papers quoted below. The set £+(D) is alternatively denoted as Cp — 
see [Epi 1], [ET 1-3] — and a separate term is sometimes introduced for Op*—algebras 
A in which (J+7+T)-! belongs to AMB(H) (compare the notes to Sec.6.1). Example 1 
represents a particular case of the algebra S Cc £+(S(JR")) generated by the restriction 
of the operators Q;, Py, j,k = 1,...,n, to the subspace SUR") C L?(IR”) ; the proof 
that S is closed and self-adjoint can be found in { Pow 1], Sec.5. 

Proof of Theorem 2 is given in [Las 2]. In addition to rp, there are other ways 
to equip A with a topological structure [Las 1-3]; all these topologies generalize the 
operator—norm topology on B(H) ; in general, however, they do not yield a continuous 
involution. Other possible topological structures are discussed, e.g., in [ArJ 1], [Schm 2]. 
In the cases when (D,7,) is a Fréchet space the domains D for closed Op*-algebras can 
be classified — see [LT 2]. 

While Op*-algebras in a sense represent an analogy to C*-algebras, some special 
classes of them have been studied with the aim of constructing an unbounded counterpart 
to the theory of W*-algebras — cf. [AJ 1], [ET 3]. On the other hand, there are 
structures which generalize the concept of an Op*-algebra; let us mention, for instance, 
the partial *-algebras discussed in [AK 1], [AIT 1], where the product is defined only 
for some pairs of operators. We encounter such objects in quantum mechanics, e.g., when, 
together with the operators Q, P of Example 1, we also consider H := (2m)~!P?+V(Q) 
which need not preserve S(JR). 

In Op*-algebras we may use matrix representations: an operator T € £t(D) ona 
separable H is uniquely determined by the numbers {(¢;,T¢«): j,k =1,2,...} for any 
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orthonormal basis {¢}32, C D — see [Epi 1], [ET 1]. Some results concerning ideals 
in B(H) can also be extended to £+(D). For instance, £+(D) contains several ideals 
which turn into K(H) in the case D = H ; two of them are such that the set F(D) of 
all finite-dimensional operators from £*+(D) is dense in them with respect to Tp — ef. 
{Tim 1]. In some cases one can prove that £L+(D) contains just one tp-closed ideal C ; 
it is also possible to construct a faithful representation of the factor algebra CLt(D)/C, 
which generalizes the Calkin result mentioned in the notes to Sec.6.4 — see [L6T 1]. The 
paper [Tim 2] describes a way of using the ideals in B(H), e.g., Jp, p 2 1, to construct 
ideals in Lt+(D). 

A more complete version of Theorem 3 and related results can be found in [Schm 1]. 
It is worth mentioning that this assertion is considerably weaker than Theorem 6.4.1 since 
in the case D = H condition (a) holds only if dim# < oo. Similar restrictions apply 
to other results on continuity of the functionals which admit a trace representation — ef. 
[LT 1]. On the other hand, an assertion analogous to Theorem 6.4.1 can be proven for a 
class of generalized W*-algebras mentioned above — see [ AJ 1]. 


Problems 


1. The relations ||a|] = supyey<y ||abl] = supyey=: |]ab|| = maxyej=1 ||ab|| are valid in any 
C*—algebra. 


2. Let A bea C*-algebra with the unit element. Prove 
(a) the relation a*a = e implies r(a) = 1. In addition, if the element a is unitary, 
then o4(a) C{AEC: |A| = 1}, 


(b) r(a) = |ja|| holds for any normal a € A, and moreover o4(a) C [—|la], |lall] 
provided a is Hermitean, 


(c) if p is a complex polynomial, then o4(p(a)) = p(a.4(a)) holds for any ae A. 


Hint: (a) Use Theorem B.2.4d. (b) If a = a*, |A| > |la|], consider the element 
(a+i|Ale)(a—i|Ale)—!. (c) Use the root decomposition of p(a)—Ae. 


3. Let A bea *-algebra with the unit element; then there is at most one norm which 
makes it a C*—algebra. 
Hint: Use the relation |lal| = r(a*a)!/2. 


4. The direct sum of C*-algebras Ag, a € J, with the norm ||[aa]|loo := suUPge_ |l@alla 
is a C*-algebra. 


5. To given C*~algebras Ai, Az there is at least one realization of their tensor prod- 
uct. If (A,@) and (A, @) are two such realizations, then there is a isometric 
«-isomorphism y: A -— A such that y(a})@p(a2) = (a; @ ag). 

Hint: Use Theorem 6.2.6. 


6. Prove the relations between the five topologies in Remark 6.3.1. 


7. The neighborhoods U,.(B;®) := {C : pe(C—B) <e} for all ® € H form a basis 
of the o-strong topology in B(H). Is the analogous assertion valid for the o-weak 
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8. 


10. 


Ul 


ie: 


13. 


Die 


15. 


16. 


topology? - 
Hint: Components of ®;, 7 = 1,...,n, can be arranged into a single vector ® € H. 


Prove: (a) Let S be a symmetric subset in B(H) ; then S’ is a W*-~algebra. 
Moreover, the W*-algebra Ay(S) generated by S equals S”, and Aw(S) = 
Aw((S)w). 

(b) Let E be a projection different from 0, I. The algebra A:={aE: ae} 
is weakly closed and A” = {aE+G(I-E): a,8EC}. 


(c) Given a set S C B(H) we define Gs := (Mes KerB)+ and denote the 
corresponding projection as Es. Then BEs = B holds forall BES; if S 
is symmetric we also have EsB= B. 


Hint: (b) Any B € (A)w is of the form B=aE,aeC. 


Prove the von Neumann density theorem: Any nondegenerate *-subalgebra A C 
B(H) is dense in A” with respect to any of the topologies Tw, Ts, Tow, Tos - 


Let A bea W*-algebra; then the relations (Ag)! =(A')g and Za, = (Za)k 
hold for any projection FE A. 


Let Ac B(H) bea W*-algebra. A projection E € A is said to be minimal if 
there are no nonzero projections F£,, H2 in A such that E = F\+£». In that case, 
||B@|| = || BE|| \|d|| holds for all 6 € EH, Be A. Any minimal projection in A is 
Abelian. 

Hint: Use Lemma 6.5.3. 


Let py: A—B bea *-morphism of W*-—algebras; then 


(a) if B € A is Hermitean (respectively, positive, a projection, unitary), the same 
is true for p(B), 
(b) ||y(B)|| < ||Bl| holds for any B € A; it turns into identity if y is injective, 


(c) if A € A is Hermitean and g is a continuous real-valued function, then 
9(9(A)) = 9(¢(A)); 
(d) (Za) = Zp holds provided ¢ is bijective. 


Hint: (b) Use Proposition 6.1.2. (c) Prove the assertion first for polynomials, then 
use Theorem 5.2.2. 


Prove Proposition 6.3.7. 


Suppose that AC B(H) is a W*-algebra whose center contains a complete system 
of projections {Ea : a€ J} (cf. Sec.3.2); then there is a spatial isomorphism 
between A and @gc7 Aka: 


Using Proposition 6.3.8 show that any discrete factor is homogeneous. 


The map 7: 7(B) =TrB is a faithful normal trace on B(H). 
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1p 


18. 


ao: 


20. 


27. 


22. 


29. 
24. 
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The assumption that W commutes with all E, in Theorem 6.4.7 is essential: with 
the exception of the trivial case, card J = 1, there are other statistical operators 
without this property which satisfy relation (6.5). 

Hint: Consider the algebra A:=C OC. 


For any o-weakly continuous linear functional on a W*-algebra A C B(H) there 
is a trace class operator W such that f(B) = Tr(WB) holds for all Be A. Ina 
similar way, generalize Theorem 6.4.7a. 

Hint: Modify the proof of Theorem 6.4.1. 


Suppose that 7 is a two-sided ideal in algebra B(H) on a separable H, and E, F 
are projections; then 

(a) 7” = {0} implies 7 = {0}, 

(b) if E,F have the same dimension and E € J, then also Fe J, 

(c) any projection in 7 is finite-dimensional, 

(d) if H>F and EE J, thenalso Fe 7. 
Hint: (a) Let B € J ; then the spectral decomposition of A := B*B contains a 


nonzero projection E,4(J) such that A} F4(J)H is invertible. (b) Use a suitable 
partial isometry. 


Prove Theorem 6.3.8. 

Hint: If J is such an ideal, the set 7” consists of all finite-dimensional projections. 
To prove K(H) C J C K(H), one can consider Hermitean operators only; use the 
argument from the preceding problem to show that E4(J) € J provided 0¢ J. 


Let H be a separable Hilbert space; then 


(a) algebra B(H) has no nontrivial two-sided ideal which is weakly (strongly, 
o-weakly, o-strongly) continuous, 


(b) algebra K(#1) has no nontrivial two-sided norm-closed ideal. 
Let S, S;, Sp be symmetric operator sets; then 


(a) any T€S is closable and Te S, 


(b) if the set S, \ S2 consists of closed operators only, it is symmetric; find an 
example showing that the closedness assumption is essential, 


(c) the union (intersection) of any family of symmetric sets is symmetric. 
A real subalgebra in the set of all Hermitean operators on a given H is commutative. 
Let Sp, Sp, Sy be the sets introduced in Section 6.5 to a given S C Loa(H) ; then 


(a) Sp is symmetric iff S is symmetric; in that case Sy is also symmetric. The 
set Sy can be symmetric even if S and Sp are not symmetric, 
(b) if any of the sets S, Sp, Sy is commutative, the other two are also commutative, 


(c) if S is symmetric, it is commutative iff the same is true for Sp, 
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25. 


26. 


oF. 


28. 


29. 


30. 


31. 


32. 


33. 


(d) S’ =(S,)! = (Sz)' = (Sp)’ ; if S is symmetric we also have S’ = (Sp)’. 


Given aset S C Ly sa(H) weconstruct S, in the following way: we keep all bounded 
operators and replace each unbounded self-adjoint A € S by A, := arctan A. Show 
that set S, has the same properties as S, in the preceding problem. 


Prove: (a) Let # be separable. Corollary 6.5.6 extends to the situation when S 
is a commutative symmetric subset in £,,(#), where the last named set is 
obtained by adding to B(H) all normal operators on H. 


(b) Suppose that A is a generating operator of aset S C Lb sa(H) ; then {A}! = S! 
and {A}/=S". 

(c) 8” > Upes{B}” holds for any set S C Ly,sa(H) while the opposite inclusion 
need not be valid. 


Let S= {A}N, be a commutative family of self-adjoint operators on a separable 
Hilbert space H. Using the notation introduced in Sec.6.6, show that P() € S” 
holds for all {k}n € Ky and },, P™ =I. 


The result of Example 6.6.3a can be extended: the sets {f(Q)} and {f(P)} are 
CSCO for any continuous strictly monotonic function f : JR — IR. What does the 
commutant {f(Q)}’ look like if the function f is continuous but only piecewise 
strictly monotonic? 
Hint: Cf. [Nas 1]. 
Define the operators Q;, 7 =1,2 on L?(IR?,d(u1 ® p2)), where the measures p; 


are supposed to be Borel, not necessarily finite, by (Q;w)(x1,%2) := 2j;(21, 22). 
Prove: 


(a) the set {Q1, Q2} is a CSCO, 
(b) Q: =Q,, @/2 and Q2=h®@Q,,. 


Hint: (a) Let @; be a cyclic vector for Q,, on L?(UR,dp;) ; show that dy2 : 
12(41, £2) = $1(21)¢2(%2) is cyclic for {Qi,Q2}. 


Show that the operators V,, defined in the proof of Theorem 6.6.5 obey the relations 
V,t (B® I2)Va = 6a6B for any B € B(Hi), and moreover, VaVgi = (Wg,-)Ya- 


The W*-algebras generated by symmetric sets S; C B(H;), j = 1,2, satisfy 
the relation Aw(S,) = Aw(S)) ® ((H2) and the analogous identity is valid for 
Aw(S»). In addition, if each of the sets S; contains the unit operator, then also 
Aw({A1 ® Ag: A; € S; }) = Aw(S1) @ Aw(S2) . 


Let A bea *-subalgebra in B(H) containing the unit operator. If A’¢ is dense 
in H for some vector ¢ € H, then also AG =H. 


Prove: (a) A symmetric operator set S is reducible iff it has a nontrivial closed 
invariant subspace G and EgD(T) C D(T) holds for any Te S. 

(b) Let S C Lysa(H) be a symmetric set. If any of the sets S, S”, Sp, Sp, S+ 
and S, (cf. Sec.6.5 and Problem 25) is irreducible, then the others are also 
irreducible. 
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(c) If L CH is an invariant for S C B(H), the same is true for L. 
(d) A nonsymmetric S C B(H) can have an invariant subspace even if S’ = €(H). 
Hint: (d) Consider the set of upper triangle matrices in B(€?). 


84. Check the claims made in Remark 6.7.4. 


Hint: Choose the basis in which L3 is diagonal and use the commutation relations 
between Pauli matrices. 


35. Let AC Lt(D) be an Op*-algebra. If D =H; then AC B(H) and the uniform 
topology coincides with the operator—norm topology in B(H). 


Chapter 7 


States and observables 


From now on we shall discuss how the theory of Hilbert-space operators explained 
in the previous chapters is used in the treatment of quantum systems. To begin 
with, we have to say something about states and observables; these notions play 
a fundamental role in any branch of physics but for quantum theory a thorough 
analysis of them is exceptionally important. 


7.1 Basic postulates 


A good manner in physics is to start from classical mechanics. A classical system 
of point particles is completely described at a given instant if we know the values of 
all generalized coordinates and momenta; this information serves at the same time 
as the initial condition for the solution of the equations of motion, which determine 
the time evolution of a state as a trajectory in phase space. The simplest example 
is a point particle whose motion is constrained to a line; the phase space is the 
plane JR?. A family of allowed trajectories is associated with any Hamiltonian, 
for instance, concentric circles correspond to the harmonic oscillator, H(q,p) := 
(2m)! (p?+m?w*q?) , with a proper choice of units. 

In classical physics we usually disregard the way in which a given system has 
achieved its moving state. Generally speaking, this is the result of its previous 
history, which may include the evolution governed by the appropriate dynamical laws 
as well as a sequence of operations performed intentionally by an experimentalist, 
or possibly a combination of both. In some physical disciplines the “spontaneous” 
way is the only one, or at least it is strongly preferred, say, in astrophysics or in 
geophysics where human-made states (for example, artificial quakes of the Earth’s 
crust) are rare. On the other hand, in the case of microscopic objects (elementary 
particles, atomic nuclei, etc.) we mostly deal with the preparation of a state in the 
true sense. 

The reason why the difference between the two ways is more than a technical 
matter is hidden in the measurements we use to determine the state. Classical 
physics supposes — quite rightly in its own domain — that the influence of the 
measuring process on the investigated object can be made arbitrarily small. This is 
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no longer true when we treat microscopic objects; the tools we use for measuring are 
macroscopic and experience tells us that their influence cannot be neglected. This 
fact itself need not represent an obstacle if we were able to formulate the dynamical 
laws governing the process of measurement. The trouble is that we do not know 
how to describe in a deterministic way the system consisting of a microscopic object 
and a measuring apparatus. Quantum theory provides, for instance, information 
about the behavior of an atom interacting with an electromagnetic field, but we 
are not able to predict which emulsion grain will blacken or which one of the Geiger 
counters placed in line with the Stern—Gerlach apparatus will click. We can compute 
and verify experimentally only the probabilities with which these events occur. 

The conditions under which a state is prepared have different impacts on the 
final result; according to that we can regard them as substantial or nonsubstantial. 
There is no general criterion, which would allow us to decide about the substan- 
tiality of a given condition; an analysis of a particular physical situation is always 
required. Another general feature of the state preparation is its replicability. This 
is not only easily realized when we deal with microscopic objects, but it also repre- 
sents a fundamental requirement, which must be fulfilled to verify the probabilistic 
predictions of the theory. To replicate a state means to prepare it according to the 
same prescription, with all the substantial conditions preserved. 

Hence we arrive at the formulation which is usually presented as the definition 
of a state: a state is a result of a sequence of physical manipulations with the 
system, which together form a preparation of this state. Two states coincide if all 
the substantial conditions of their preparation are identical. In a similar way we 
can formulate the “operational” definition of an observable (dynamical variable): 
we associate with it a suitable instrument (measuring apparatus), which displays 
(records) a measured value when we let it interact with the system. 

Of course, such a definition does not take into account the fact that the same 
physical quantity can be measured in substantially different ways. If we want to 
measure, for instance, the longitudinal momentum in a beam of charged particles 
produced by an accelerator (or more precisely, to find whether its value belongs 
to a chosen interval), we can use a magnetic separator, a pair of detectors with a 
delayed-coincidence link, or a differential Cherenkov counter provided the particles 
are relativistic. Hence using the above mentioned “operational” definitions one 
must keep in mind that there is no bijective relation between the observables and 
measuring instruments. Since measurement is an unavoidable part of the state 
preparation, the reproduction “under the identical substantial conditions” requires 
that the results of all the performed measurements should coincide irrespective of 
the experimental procedures used. On the other hand, in Section 7.5 below we shall 
discuss which collections of measurements have to give the same results in order to 
ensure that the states coincide. 


7.1.1 Remarks: (a) There is also no clear distinction between systems and states 
or families of states. For example, two photons and an electron—positron pair 
can be regarded as two states of the same system since (under the appropriate 
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kinematic condititions) one of these entities can evolve into the other. Similarly 
the pions 1+, 7°, m— can be understood as three isotopic states of a single 
particle; the reader will easily find other examples. 


(b) The substantial conditions of state preparation do not include — not only in the 
microscopic world — the time and place when and where the event occurred. 
E. Wigner has stressed that this observation represented historically the first 
invariance law, which enabled physics to be established as an exact science. 


(c) In addition to dynamical variables, which are always related to a particular 
system, there are other measurable quantities. The most prominent example 
is time, which is a universal parameter for all nonrelativistic systems. Some 
characteristics of external fields acting on the system may also appear in the 
theory as parameters; recall the Stark or Zeeman effect. Other examples 
are provided by various universal constants: the electron charge e, Planck’s 
constant fi, etc. 


After this introduction we are going to discuss the ways, in which the states and ob- 
servables are described in quantum theory. We begin with the following postulates: 


(Qla) a complex Hilbert space # called the state (Hilbert) space is associated 
with any quantum system, 


(Q1b) a ray, z.e., a one-dimensional subspace in H corresponds to any state of 
the system under consideration. 


The states described by rays are called pure; a more general concept of a state 
together with the appropriate modification of the postulate (Q1b) will be given in 
Section 7.3 below. Each ray WV is spanned by some unit vector 7% € WV ; in common 
parlance the difference between the two objects is often disregarded. Any two unit 
vectors ~,7' € W differ at most by a phase factor, wy’ =ew with ye R. 

Before we proceed, let us recall the considerations used to motivate the above 
postulates. They are based on an analysis of simple experiments, such as observation 
of the interference effect in polarized light understood as a beam of corpuscular 
photons, spin measurements by a pair of Stern-Gerlach devices of differing mutual 
orientation, etc. To explain the results, one has to adopt the following assumptions: 


(i) if we perform a measurement of a certain observable on a system, which is in a 
state WY, then as a consequence the state changes into one of a set {Uj}, 
which is determined by the observable. In this sense therefore any state of the 
system can be regarded as composed of the states {Vj}/,, 


(ii) the outcome of the measurement has a probabilistic character, i.e., one is gener- 
ally able to determine only the probability P(V,V,) with which the mentioned 
transition occurs. This probability becomes certainty if UY coincides with some 
W; ; then we have P(W;, V;) = dj% . 
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nevertheless ask whether quantum theory could not with the same success use a 
state space of another structure. To answer this question, we must first axiomatize 
the properties of the measurements and then discuss whether the postulates (Qla) 
and (Q1b) represent the only possible way to satisfy these conditions. We shall treat 
this problem in Chapter 13 where we shall see that, for example, the choice of a 
complex Hilbert space is to certain extent arbitrary. 

In quantum mechanics we usually add another requirement of a mathematical 
nature: we suppose that 


(Qla) the state space of a quantum mechanical system has the structure of a 
complex separable Hilbert space. 


In fact, the class of separable Hilbert spaces is rich enough not only for quantum 
mechanics but also for most problems of quantum field theory and statistical physics. 
The convenience of the assumption is that it allows us to use countable orthonormal 
bases, and morever, we know that some of the results of the previous chapters were 
derived under the separability assumption. There is a heuristic argument supporting 
the strengthened version of the postulate; we shall mention it in Remark 11.1.1b. 

Let us turn now to a description of the observables. We add a new postulate to 
the list, namely 


(Qic) a self-adjoint operator on the state space is associated with any observable 
of the system. 


As in the case of the state space, we postpone the discussion of the ways in which this 
correspondence is realized. To motivate the postulate, we shall again use a simple 
system with N independent states VY; , which can arise as a result of measuring an 
observable A. We know that the rays {W;}_, are mutually orthogonal, and we also 
have the sequence {a;}j_, of real numbers representing possible outcomes of the 
measurement; this uniquely determines a self-adjoint operator A such that W; is 
for any j the eigenspace corresponding to the eigenvalue a;. The postulate (Q1c) 
extends this correspondence to all observables including those with a nonempty 
continuous spectrum. We have to say more, however, about the results we can 
obtain when a particular observable is measured. 


7.1.2 Example: The Hilbert space €? describes, e.g., electron spin states. The 
spin projections at the j-th axis correspond to the operators S; := 305, where 
o; are the Pauli matrices (in fact, S; := $ho; ; for the sake of simplicity we put 
ht = 1 as we shall almost everywhere in the following). Each of them has a simple 
spectrum, o(S;) = {4,—4} and its spectral decomposition is S; = 4E\— LEO, 
where i.e := 4([+0,;) (Problem 1). Suppose now we are measuring the quantity 
S; on an electron whose state is represented by a unit vector x € €?. Experience 
tells us that 


(a) the eigenvalues +} of the operator S; are the only possible outcome of the 
measurement, 
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(b) the probability of finding any particular one of the spin component values is 
+ 
ws = w({4h}, S5x) = (x, BY), 


(c) for the mean value (S;), := 4w4 — jw-, we then easily get (Sj), = (x, Sjx). 


Similar conclusions can be made for other simple systems; motivated by them 
we can extend our list of postulates. To simplify the notation, we shall use mostly 
the same symbol for an observable and for the operator associated with it. Suppose 
therefore we have a system with the state space # and an observable described by 
a self-adjoint operator A ; we denote the si aaa projection-valued measure 
and spectral commune as Ei, and {ae hay respectively. If the system is in a 
state YW determined by a unit vector = € H, then 


(Q2a) the possible outcome of measuring the observable A will be the points of 
the spectrum o(A) of the operator A, 


(Q2b) the probability of finding the measured value in a Borel set AC FR is 
w(A, Ais) = f ad, BY) = NECA? 


The assertion (c) in the example has followed from (b). In the same sense the 
postulate (Q2b) implies that the mean value of the measurement results is given by 


the relation 
(A)y = (y, Ay) (7.2) 


provided the right side of this relation makes sense; we shall prove this in a more 
general context in Section 7.3 below. It is clear that the quantity w(A,A;qw) in 
(Q2b) does not depend on the choice of the vector ~ representing the state V. We 
must also check that it can indeed be interpreted as a probability. In the Kolmogorov 
axiomatic approach, a probability measure is a (non-negative, o—-additive) measure 
on a o-algebra of events which is normalized, 7.e., the measure of the whole g-al- 
gebra is equal to one. These requirements are satisfied because w(-,A;7) is a Borel 
measure according to Proposition 5.1.1¢c and w(o(A), A;~) = 1; an event A here 
means that measurement result is contained in the set A. 

The measurement of spin components discussed in Example 2 represents the sim- 
plest nontrivial (dichotomic) case of a measurement: there are only two mutually 
exclusive results. Measurements of that type are often called a yes-no experiment. 
Almost all actual measurements, of course, have a more complicated structure; the 
importance of yes~no experiments stems from the fact that the measurement of any 
observable can, at least in principle, be regarded as a collection of yes-no experi- 
ments. One can always divide the scale of the apparatus into intervals and ask: will 
the measured value be found in a given interval — yes or no ? The most illustrative 
example of this equivalence is provided by various multichannel-analyzer devices 
frequently used in experimental physics. 

The self-adjoint operator corresponding to a yes—no experiment has by postulate 
(Q2a) just two different eigenvalues a, € IR. Without loss of generality we may 
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suppose that these values are a = 1 (yes, the positive result) and G = 0 (no, 
the negative result); hence each yes—no experiment is associated with a projection 
onto some subspace in #1. This makes the identification of an observable with a 
collection of yes—no experiments more transparent. If a projection P, corresponds 
to the yes—no experiment “Is the measured value of the observable A contained in 
the set A ?”, then we have w({1}, Pa;w) = w(A, A;w). Since Ep, ({1}) = Pa 
the postulate (Q2b) gives 


(x, Paw) = (¥, E4(A)y) 


for any vector 7% to which some state of the system corresponds. If this is true for 
all ~» € H we obtain Py = E,(A) for any Borel set A C JR. We shall see in 
Section 7.4 that, in general, there are vectors which are associated with no state, 
but the conclusion will remain valid because we shall at the same time adopt a 
restriction on the set of admissible observables. Projections corresponding to the 
mentioned yes—no experiments are therefore nothing else than the values of the 
spectral measure E,4. 

We already know what kinds of mathematical objects are ascribed to the states 
and observables, and what the predictions of measurement results can look like; 
it remains to answer the question of what happens to the state of the system, 
which has suffered a measurement. In the heuristic consideration presented above 
we have stated the answer for the case of an observable whose spectrum is pure 
point and simple. In the general situation we use equivalence with the collection of 
yes—no experiments and formulate the answer for the observable associated with the 
spectral projection F(A) as an additional postulate: 


(Q3) if the result of the experiment is positive (the measured value is contained in 
the set A), the system after the measurement will be in the state described by 
the unit vector E4(A)~/||E4(A)?|| ; in the opposite case we have to replace 
E,(A) on E,UR\ A) =I-E,(A). 


This makes sense, of course, only if the norm appearing in the denominator is 
nonzero; but this condition causes no trouble. For instance, ||F4(A)#|| = 0 
holds iff the probability of finding the value in A is zero (provided the system 
is in the state WV); in this case therefore we get a negative result with certainty, 
||E4UR \ A)w|| = 1, and the state of the system after the measurement will be de- 
scribed again by the vector ~. The yes-no experiment under consideration can 
also, however, be regarded as a filter, z.e., one can take into account only those cases 
when the measurement has given a positive result; the assumption ||F4(A)q|| = 0 
then means that the filter is closed for the system in the state Y and it naturally 
makes no sense to speak about what will happen after the passage through it. 

The postulate (Q3) has the following important consequence: if a yes—no ex- 
periment yields, e.g., a positive result, then the same result will be found if the 
experiment is repeated immediately. Indeed, the vector ob’ = E4(A)w/||E4(A)y|| 
is reproduced by E4(A), so the probability w({1}, F4(A); ~’) = 1 by the postulate 
(Q2b). This is also in agreement with experience; one should be aware, however, 
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of the indefiniteness hidden in the expression “immediately repeated”. In an ac- 
tual experiment we have to pay attention whether the change of state between the 
two measurements may be neglected; this question can be answered for a particular 
system only from a knowledge of its dynamics. 

We have motivated the postulates by simple arguments about observables with 
pure point spectra; the conclusions are easily seen to be valid for any observable of 
this type. 


7.1.3 Example: Let an observable A be described by a self-adjoint operator on a 
separable +H whose spectrum is pure point, (A) = 0,(A) with op)(A) = {Aj : j= 
1,2,...}. Denote the projections onto the eigenspaces by P; := Ea({A;}). If the 
state before the measurement is described by a unit vector ~ € 7, then 


(a) the possible outcomes of the measurement are the eigenvalues A; , 7 = 1,2,..., 
(b) the probability of finding the value A; is w({A;}, A;v) = (W, iv) = |B ll?, 


(c) if we find the value A;, the state after the measurement is described by the 
vector Pj~p/|| Piel . 


We assume implicitly that the device can distinguish any two eigenvalues of the 
operator A; it is not difficult to modify the result for the case of an apparatus with 
limited resolution. It is important to keep in mind that this differs substantially 
from the situation, where the device does measure a particular value from a set 
A := {,,.--,An} but we register only that the result lies within A ; a more 
detailed discussion will be given in Section 7.5. 


An attentive reader may object to the assertion (a) of the above example that 
in view of the postulate (Q2a) the measurement can in addition yield values, which 
belong to the closure of o,(A) but not to the set itself. The answer is that the differ- 
ence has no measurable consequences: due to Proposition 5.4.1b E,4({A}) =0 holds 
for any A € o(A) \ 0,(A), and therefore w({A}, A;z) = 0. A different conclusion 
would be strange anyhow, because no measurement can distiguish “arbitrarily close” 
quantities (see also the notes). Let us finally mention two other simple consequences 
of the postulates. 


7.1.4 Example: Given a state ® described by a unit vector ¢, consider the yes—no 
experiment “Is the system in the state @ ?”. Let Eg denote the corresponding pro- 
jection. The quantity w({1},£s;~) is nothing else than the transition probability 
(7.1); hence we have ||Ez{|? = |(¢,27)|? for any state W. If a state of the system 
corresponds to any nonzero 7 € H, the last relation implies Es = (¢,:)¢, i.e., Ey 
is the projection onto the ray ® in H. The same is true even if the assumption is 
not valid, as we shall see in Section 7.4. 


7.1.5 Example: Consider a pair of Borel sets A,,A2 € IR. The probability 
w(A; M Ao, A;~) can be expressed through a pair of consecutive measurements. 
Suppose, for example, that we have first performed the yes-no experiment E,4(A)) : 
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with the probability ||£4(A1)2]||? we find the positive result and the state after the 
measurement will be described by #’ = E4(A,)w/||E4(A1)y||. Next, we perform 
the yes—no experiment E’4(A2), where the probability of the positive outcome is 


|| E4(A2)E4(A1)y|/? 
||Z4(A1)¥||? 


The two described measurements clearly represent independent events; hence the 
joint probability of finding both positive results can be expressed in the product 
form, w(Az, Ai, A; v) =a w(Ag, A; v')w(A1, A; w) = ||Ea(A2)Ea(A1)¥|/, and it 
follows from Proposition 5.1.1 that it equals w(A,;M A>, A;~). Interchanging the 
order of the yes—no experiments, we get 


|Ea(Ae)o"||? = 


w(AiN Ae, A;y) = w(Ae,Ai, A; ~) = w(Ad, Ag, A; #) 


and also the state after the complete measurement is in both cases given by the same 
vector, namely E4(A; 9 Ag)y/||E4(A1 N Ag)y||. Hence the yes~no experiment 
E4(A; 1 Ag) is equivalent to the experiments E,4(A;),7 = 1,2, performed in 
any order if the positive result of Fi4(A;M Ag) means a positive result in the two 
measurements; this conclusion easily extends to any finite number of measurements 
(cf. Problem 12). 


7.2 Simple examples 


Before we proceed let us mention several elementary systems known from the intro- 
ductory chapters of quantum mechanics textbooks. Apart from the cases when the 
state Hilbert space is finite-dimensional, the simplest situation occurs if H = L?(JR) 
or a subspace of it corresponding to an interval in JR. In reality, state Hilbert spaces 
have a more complicated structure, but we can often get a useful one-dimensional 
model by separation of variables, as we shall see in Section 11.5. 


7.2.1 Example (a spinless particle on line): The state Hilbert space is H = L?(JR) 
and the fundamental dynamical variables of the “one-dimensional” particle are its 
position and momentum represented by operators @ and P from Examples 4.1.3 
and 4.2.5, respectively; a motivation for this choice will be discussed in Section 8.2. 

Consider first the position operator Q. We know that its spectrum is purely 
continuous, o(Q) = JR, so any real number can be a result of position measurement. 
Suppose that the state of the particle is described by a unit vector % € L?(UUR) (it 
is customary to speak about a wave function if H is a functional space); then the 
postulate (Q2b) in combination with Example 5.5.1a gives 


w(A,Q;¥) = f W@)Pae (7.3) 


for any Lebesgue measurable A C JR, in particular, for an arbitrary Borel subset of 
the real line. Since the probability measure w(-,Q;~w) is generated by the function 
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|(-)|?, and the Lebesgue measure, we usually say that |y(x)|? is the probability 
density of finding the particle at a point x. If the particle is found in a set A, its 
state after measurement is given by the vector 


(Eq(A)y)(z) _ _xa(z)o(z) 


vee) =" eames 


where yq is the characteristic function of the set A. The operator P can be treated 
using unitary equivalence (4.4): we have P = F-'QF where F is the Fourier- 
Plancherel operator. By Examples 5.3.11c and 5.5.1b, the probability of finding 
the particle momentum in a Borel set A is equal to w(A, P;#) = ||Eo(A)Fy|\*. 
This expression can be rewritten in the form w(A,P;~) = f, |o(k)/?dk, where 
@ := Fw can be computed as in Example 3.1.6. If the wave function in addition 
belongs to L'(JR), the formula simplifies to the Fourier transformation, ¢(k) = 
(27)-1/? fr e-**h(x) dx , and we arrive at the textbook result. 

Another important observable for a one-dimamiaaa particle (of a mass m) is 
its kinetic energy described by the operator Ho := =P?. a Proposition 5.5.3, 
it has a purely continuous spectrum, o(Ho) = age + ; the probability of 
finding the kinetic-energy value in a set A is easily seen to — 


w(A, Hos) = fi. 1O(k Pak, 


where h(k) := k?/2m. If the particle is free (noninteracting) then operator Ho 
also, of course, describes its total energy. 


7.2.2 Example (a particle on halfline): The state space is in this case H := 
L?UR+). The position operator Q : (QYv)(xz) = zy(x) with the domain D(Q) := 
{pb : fo? x?|b(zx)|*?dx < 00} is self-adjoint and the conclusions of the previous 
example modify easily for it. In distinction to it (and to the situation in classical 
mechanics), there is no momentum observable for a particle on halfline: we know 
from Example 4.2.5 that there is no self-adjoint operator corresponding to the formal 
expression —id/dz. 

The most interesting case is the operator of kinetic energy, which can be defined 
but in a nonunique way. For simplicity we neglect the numerical factor, i.e., we 
put m = 1/2. In Example 4.9.6 we have shown that there is a one-parameter 
family of operators corresponding to the expression —d*/dx? , namely Hoe := T(c) 
with the domain D(c) := {% € AC?(0,00) : w/(0) — ceW(0) = 0} for ce R and 
D(oo) := {wv € AC?(0,00) : Y(0) = 0}. The spectrum o(Ho,) D (0,00) , where 
inclusion turns into identity for c > 0, while in the case c < 0 we have an additional 
eigenvalue —c* corresponding to the eigenvector ¢. : (x) = /—2ce™ 

How are we to understand these facts ? We shall present a heuristic argument 
(which can nevertheless be made rigorous — see the notes). For any k € IR and 
€ > 0 we define the vector dye: bke(x) = (e7**4+ R-e***) e-€*" | where the coefficient 
R, is chosen in such a way that ¢,,¢ € D(c), i.e., 


ik+e+c 


RR := —————_.. 
‘ ik—e€—c¢ 
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Since s-lime_o(Ho,< — k*)@x,. = 0 we may regard the function ¢x as a generalized 
eigenvector of Ho,- corresponding to eigenvalue k? belonging to the continuous 
spectrum (compare to Corollary 4.3.5). It is not normalized, of course, but it shows 
how the wave functions behave in the vicinity of the point z = 0. The expression 
op(xz) = e~** + Re*** with R:= Ro can be understood as the superposition of the 
incident wave and the reflected wave whose phase is changed on arg R; for each c 
this quantity is a different function of k (or of the energy variable k?). In addition, 
in the case c < 0 the particle can exist in a state bound to the “barrier”, which is 
described by the vector ¢.. Hence we see that different self-adjoint extensions of 
the formal Hamiltonian —d?/dx? describe different physical situations. 


‘7.2.3 Example: Next, we shall mention the operators describing the position and 
momentum of a real free spinless particle whose state space is H := L?(R°). To 
the three Cartesian components of the position vector x = (x1,22,2%3) we ascribe 
the operators Q; : (Q;~)(x) = 2;%(z), 7 = 1,2,3, with the domains D(Q;) = 
{vy € LUR®): fps 23|\b(x)|?dx < oo}. By Examples 4.3.3 and 4.3.7, these are 
self-adjoint with the purely continuous spectra, o(Q;) = JR, and 


(EQ,(A)Y)(x) = xa(zs)h(2) 


holds for any Borel set A Cc JR ; from here it is easy to express the probabilities 
w(A,Q;;~) and the state of the particle after such a measurement by analogy with 
Example 1. Recall also that the Q; are related to the operator Q of this example 
by the relations Q; = Q @1@ 1, etc., which can be checked as in Problem 6.29. 
In a similar way, we associate with the Cartesian components of the momentum 
the operators P;, 7 = 1,2,3, defined by P,) = P@I@/, etc.; they are self-adjoint 
by Theorem 5.7.2. We shall check that the operators Q,; and P; are again unitarily 
equivalent, 
P; = F5'Q;Fs, j=1,2,3, (7.4) 


by means of the Fourier—Plancherel operator F3. Using Proposition 4.5.6 and Prob- 
lem 4.34, we find the inclusion P, C F;'Q,F3, and the relation Q = FPF7! 
implies in the same way Q, C F3P,F;'. Since inclusions between operators are 
preserved by unitary equivalence, we get the relation (7.4); the proof for 7 = 2,3 is 
analogous. 

Momentum-component operators have the domains D(P;) = F;1D(Q;). How- 
ever, the way in which we have defined them specifies their action directly only 
for some vectors, namely those of the form ~(x) = Dy) Tv” (21), where 
yi) € D(P;) and y\" © L?(R) for the other two values of the index | differ- 
ent from 7 ; the definition of the unbounded-operator tensor product then yields 


(Pyles (32) ey (78) 


This formula is, of course, also valid for other vectors; we shall check that it 
holds for all 2 € S(UR®). By Example 1.5.6, F;' maps S(JR*) onto itself, 
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so S(IR’) C D(Q;) implies SUR?) C D(P;). Furthermore, we have (x) = 
(2m) 3/2 fra e*** (F3eh)(k) dk for any y € S(R*). The integrated function is ma- 
jorized by |(Q;F3%)(-)| independently of x ; hence the integration and differentia- 
tion may be interchanged, 


J (34) (1) = (2n)-9? fe *hj(Fav)(k) dk = (Fs1Q)F)(a) = (Pv)(@), 


which is what we set out to prove. Since S(JR*) is a core for P;, the relation (7.5) 
can be used as an equivalent definition of the momentum-—component operators (cf. 
Problem 5 and the notes). Notice finally that the same conclusions can be drawn 
for the operators Q;, P; on L?(JR") for any positive integer n. If n = 3N, for 
instance, then such operators describe the positions and momenta coordinates in a 
system of N spinless particles. 


One of the most important observables is the total energy of the system; the 
corresponding operator is conventionally called Hamiltonian. We have already 
encountered free—particle Hamiltonians in the first two examples; now we want to 
mention two other simple systems. 


7.2.4 Example (harmonic oscillator): Consider again the state space H := L?(JR). 


The operator 
H: (Hd)(y) = —¥"(y) + 9d) 


with the domain D(H) :={y~ € H: y,y"' absolutely continuous, fp| — v’(y) + 
y’~(y)|?dy < oo} is up to the dimensional factor $hw identical with the harmonic- 
oscillator Hamiltonian 

fic et pet 22 


Amy = -——— aa t+ 
= ime 
if we set y:= ,/%* 2. It is well known that H has a simple spectrum consisting 
of the eigenvalues A = 2n+1, n = 0,1,2,..., and the corresponding eigenvectors 


are the Hermite functions (2.1) which form the orthonormal basis €y in L?(JR). It 
follows from Example 4.2.2 that H is e.s.a. on the linear envelope of Ey as well 
as, e.g.,on SUR). 

In addition, H is self-adjoint due to Proposition 4.8.9b; we shall show that it 
can be expressed as H = P?+Q*. We easily get P?+ Q? C H ; to check the 
opposite inclusion we use the fact that S(R) is invariant with respect to both P? 
and @*, and the relation [P,Q"|~ = —inQ""'W holds for any 7 € S(JR). It yields 


P?+ Q?)ol? = ((P?+ Q)*p, p) 
((P*+ Q*+ 2PQ?P + [P, [P,Q?]])¥, ¥) 
= P*pl? + lQ?pll? + 2QPx|)? — 2Ip|?, 


and therefore the inequality 


Il 


(P?+ Q?) ell? + Ql)? > |PPa||? + Q?pI)?. (7.6) 
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Since S(JR) is a core for H, we can find to an arbitrary ¢ € D(H) a sequence 
{¢n} C SUR) such that ¢, — ¢ and Hd, — Hd. Furthermore, one has Hd, = 
(P?+ Q*)¢n so the inequality (7.6) can be rewritten as 


| 1(¢n—Gm)|* + 2Il(ba—dm)||? = P?(bn—bm) I? + 1Q?(n—Gm) |]? - 


It follows that the sequences {P?¢,} and {Q*¢,} are Cauchy, and since both 
operators are closed we find that ¢ belongs to D(P?)M D(Q?), i.e., the inclusion 
D(H) c D(P?) N D(Q?) which concludes the proof. 


7.2.5 Example (rectangular potential well): This time we consider the operator 


H: (Hy)(x) = —p"(z) + V(x) (z) 


with the domain D(H) := D(P?), where V := —Voxi-a,q} for some positive Vo, a. 
The operator V of multiplication by the function V coincides by Example 5.2.8 
with V(Q) ; in the present case it is bounded, ||V|| = Vo. Hence we can write 
H = P?4+V. Up to the factor h?/2m, this is the Hamiltonian of a particle 
interacting with the rectangular—well shaped potential; the operators Ho := P? and 
V correspond to its kinetic and potential energy, respectively. The boundedness of 
the potential V implies further that H is below bounded, (#, Hy) > —Voll#||?,, so 
o(H) Cc [—Vo, 00). 

Let us investigate the spectrum in more detail. The point part of it is simple and 
well known: there is a finite number of eigenvalues, all of them contained in the inter- 
val (—Vo,0). If we introduce k := /E+Vo and x := /—E for E € (—Vo,0), then 


the eigenvalues E,, := —«? are given by the solutions to equations ka tan(ka) = Ka 
and ka cot(ka) = —«a ; these are conventionally numbered starting from the small- 
est, En, < Enyi, n=1,2,.... The respective eigenvectors are 
Cy, sin (kn — %) .. |al<a 
tn: Un(z) = 
C,(sgnz)"*" sin (kna _ a) ee) Te .. Nall era 


where the normalization factor is Cy := \/Kn/(1+akn). There are just N eigen- 
values, where N is given by the inequalities (N—1)mr < 2/Voa? < Nx. Recall 
that the eigenvalues are found in such a way that we match the solutions of the 
appropriate differential equations in the three intervals so that the resulting function 
is square integrable and continuous together with its first derivative at the points 
x = +a. In other words, we choose among the candidates for the role of eigenvector 
those functions, which belong to the domain of the Hamiltonian. Similarly one can 
check that H has no eigenvalue outside the interval (—Vo, 0). 

Since the eigenvalues are simple and have no accumulation point, the remaining 
points of the spectrum belong to o¢..(H). It is easy to ascertain that the spectrum 
contains any point situated above the well, 7.e., o(H) > IR* (Problem 8). To check 
that the points of the interval [—Vo,0) with the exception of the eigenvalues do not 
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belong to the spectrum, we employ Theorem 5.4.6. We know from Example 1 that 
ong?) =F 5x80 

od) = Ces(H) = 
provided the operator V is P?-compact. The resolvent (P?+ *)~' is bounded 
for any « > 0, and by Problem 9, V(P?+ «?)~' is an integral operator with 
the kernel 1 V(x)e~"“?l. This function is square integrable so V(P?+ %?)~? is 
Hilbert-Schmidt according to Theorem 3.6.5, and therefore compact. 


7.3. Mixed states 


Let us now return to analysis of the formalism. The next thing we have to discuss is 
the postulate (Q1b) because the correspondence it describes is not general enough 
in its present form. 


7.3.1 Example: Suppose that an electron whose spin state is determined by a 
unit vector y € €? passes though the Stern-Gerlach apparatus (i.e., a slit with 
a nonhomogeneous magnetic field followed by a pair of detectors Ds) adjusted 
along the j-th axis, and that after the passage the electrons are again brought to a 
single beam by means of a magnetic collimator. We disregard the other dynamical 
variables of the electron as well as the fact that hydrogen atoms rather than single 
electrons, in fact, pass through the device. If one of the detectors Dx clicks, it 
happens by Example 7.1.2 with the probability ws := |e” x ||? and the electrons 


leave the collimator in one of the states $4 := a ‘gi Eo x||. Assume that next 
we measure on them, for instance, the k-th spin component. This can be done in 
different ways: 


(a) we choose, e.g., the electrons whose j-th spin component has been determined 
to be +4, while those in the state ¢_ are allowed to go unregistered. The 
probability of finding the value $, where a = +1, is then 


wo? = w({S}, Side) = be Hl). 


On the other hand, if we decide not to register the electrons in the state $4 , 
then we get the value 2 with the probability w\ = (@_, EO 6.). 


(b) another possibility is to cease distinguishing between the electrons and to mea- 
sure S, on each electron that has passed through the first device. In that case 
the probability of finding the value ¢ is equal to We := ww) + w_wl). 
The vectors @4, ¢- form an orthonormal basis in C? ; thus substituting for 


w*) we can rewrite the last expression as 
tig = Tr(EOW), 


where W := w, B+ wee ; notice that the operator W is positive and 
its trace equals one, 7.¢., it is a statistical operator. This allows us to compute 


7.3. MIXED STATES 265 


the probability w. without knowing in which particular state the electron had 
been after the first measurement; it is sufficient to know that any electron was 
in some of these states, together with the probability of this event. 


Let us stress that the measurement described in part (b) differs, of course, from 
the situation when the detectors are not switched on; in that case the state of the 
electron after the passage through the first apparatus is described again by the vector 
x. This can easily be checked experimentally: choosing, for example, 7 = 3, k = 1, 
and the electron state x before the first measurement such that Six = 4x, we find 


tia = } while w({}, Six) Sn 


Measurements of type (b) occur frequently in actual experiments. This moti- 
vates us to extend the notion of a state. The states described by rays in the state 
space will be called pure. More generally, we shall associate with a state a sta- 
tistical operator W := });w,;E; and interpret it in such a way that the system is 
with probability w,; in the pure state ®; := E;H. Pure states obviously represent 
a particular case in which some w,; is equal to one. The other states for which 
all w; < 1 are called mixed. The respective postulate can be now formulated as 
follows: 


(Qib) a statistical operator (density matrix) on the state space is associated with 
any state of the system. 


In combination with the definition of a mixed state, this implies 


(Q2b) the probability that measuring an observable A on the system in a state 
W we find a value contained in a Borel set A C JR equals 


w(A, A;W) = Tr(E4(A)W), 


and therefore we replace the postulate (Q2b) of Section 7.1 by this condition; it is 
clear that if W isa pure state, W = Fy, , we return back to the previous formulation. 
Mixed states are important from the practical point of view. We know that the 
probabilistic prediction of quantum theory can be verified only by performing the 
same measurement on a large number of identical copies of the system. In an actual 
experiment, however, it is usually technically impossible to achieve the situation in 
which all elements of such a family would be in a given pure state; we know mostly 
that particular copies are in some states from a subset in the state space together 
with the corresponding probabilities. In that case it is useful to assume that they 
are all in the same mixed state described by the appropriate statistical operator. 


7.3.2 Example (polarization density matrix): We want to find the general form of 
a statistical operator on C?. Any Hermitean operator is there described by a real 
linear combination of the matrices I, a1, 02, 03,. The condition TrW =1 implies 


W = -(I[+€-o), 


1 
2 
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where €-0 := D#_,60;, | € IR, because Tro; = 0. Moreover, the operator 


W must be positive. Its eigenvalues are easily found, w = (1 + |€|) where 
¢| := (3_, €?)!? ; this means that W is a statistical operator iff |€| <1. If we 
measure the j-th component of the spin, the mean value of the results is 


(Sj)w = Tr(WS;) = 54: 


The vector €, usually called polarization, is thus the doubled mean value of the 
spin. The density matrix W can describe, for instance, the spin state of an electron 
beam produced by an accelerator. If || = 1, we have W* = W so the state W 
is pure (Problem 10). The opposite extreme occurs when € = 0; then W = $1 
and all outcomes of spin measurements are equally probable. Such a density matrix 
corresponds to an unpolarized beam. 


To accept the postulates formulated above, we have to check that w(-, A;W) 
of (Q2b) is indeed a probability measure on B. At the same time we would like to 
generalize to the present situation the expression (7.2) for mean values. This is the 
content of the following assertion. 


7.3.3 Theorem: Suppose that A is self-adjoint and W is a statistical operator 
on #; then 


(a) the map w(-, A; W) of (Q2b) is a probability measure on the o-algebra B ; it is 
identical with the Lebesgue-Stieltjes measure ph? generated by the function 
fir): fi Q) = Te (BXW), 


(b) the mean value of (the results of measuring) the observable A in the state W 
is given by 


(Alp sex “ Adu (A) = Tr(AW) = Tr(WA) (7.7) 
provided the right side makes sense, i.e., AW € Ti. 


7.3.4 Remark: For the sake of illustrativeness, the integrals with respect to the 
mentioned measure are often written in a more explicit form, e.g., 


(A)w = I Xr dTr (EM w) . 


Notice that the right side can also exist in some cases when Tr(AW) makes no 
sense; for example, if the state is pure, W = Ey, then Tr(AW) = (w, Av) is 
defined for ~ € D(A) while for the existence of the integral ~ € D(\A|!/?) is 
sufficient. 


Proof of Theorem 3: By Theorem 3.6.7 E4(A)W € J; , so the trace of this operator 
makes sense. The spectral decomposition of W is of the form W = »; wjE; , where 
E; are the one-dimensional projections corresponding to normalized eigenfunctions 
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$3 : Wo; = wjd;, J = 1,2,.... The set function w(-,A;W) defined in (Q2b) then 
satisfies the relation 


w(A) = w(A, A;W) = Df wyj(A) < 1 


for any A € B, where yp; = (¢;, Ea(-)¢;) are the measures referring to the pure 
states @;. We obviously have w(@) = 0 and w(JR) = 1; it remains to check the 
o-additivity. Let A = Uf2, Ax be a disjoint union of Borel sets; then the last 
formula gives w(A) = 0; wyhy (Ux Ak) = Dj wz (Xe My (An)). The elements of the 
series on the right side are non-negative so we may rearrange it, 


v(a) = 5 (Emylan)) = Detar. 


Hence w(-,A;W) is a probability measure on B. The function rise is clearly 
nondecreasing and bounded; we shall check that it is right-continuous. Using the 
spectral decomposition of W , we find 


fw (n) — fi Q) = 2 ws(Os, (Ey — E EX”)95) 


for any pair of real numbers 7 > . The series on the right side is majorized by 
>; w; independently of 1, so the summation may be interchanged with the limit 
n — A+ ; the properties of the pestis measure then imply lim,,+ fA (9) = 
f{P (a). Using the ings of fh , we easily get wT ) = w(J) for any interval 
J C R, and therefore uy AA) = = w(A) for any A € B according to Theorem A.2.4; 
this concludes the proof of part (a). 

The operator A is self-adjoint and W is Hermitean, so (WA)* = A*W* = 
AW. Since J, is a *-ideal in B(H), it contains the operator WA = (W.A)* 
and Tr(WA) = Tr((WA)*) = Tr(AW). Next, we use the decomposition W = 
+; wjE;. The operator AW € J is everywhere defined; hence ¢; € D(A) if 
w; > 0. The quantity 


Tr (AW) = »(¢;, AW 45) = >) w;(¢;,4¢;) 


{3: w;>0} 


is real; this proves the second identity in (b). To check the first one, it is enough to 
show that the condition AW € J implies the existence of the integrals fy: |A| du (r d) 
and the relations 


fe IA dp (a) = 41 (E4(IR*) AW). 


Let {sn} be a nondecreasing sequence of non-negative simple functions on JRt+ 
such that limpoo$n(z) = x for all x > 0, and furthermore, f,° Adu (A) = 
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litmaesallge Sn (A) adp\)(d) (cf. the remark following Proposition A.2.2 and Theo- 
rem A.3.2). For any simple function s:= 1;¢;xa, we get from the already proven 
assertion (a) 


* 9(a) dua) = 2 Tr (Ea(A,)W) = Tr(s(A)W) = 2 ws(5»9(A)ds) 
Q j 


so the integral equals limpoo /; w;(¢;, 8n(A)¢;). We know that ¢; € D(A) if 
w; > 0; this implies the existence of the integral f>° A du;(A) = (¢;, EaUIR*)AQ;) - 
At the same time, the inequality s,(A) < A gives 


(65, Ea(R*)Ads) = (03, 5n(A)d,) =f 3n(A) ds(A) 


for all n and 7, so limpoo(;, 8n(A)¢;) = (¢;, Ea(IR*)Ad;) by the monotone con- 
vergence theorem. The above estimate also shows that the sum )); w;($;, 8n(A)d;) 
can be majorized independently of n, so the summation may be interchanged with 
the limit. We have therefore obtained the sought relation for the interval IR* ; the 
proof of the other one is analogous. § 


The postulate (Q3) readily implies in which state the system will be after the 
measurement when originally it was in a mixed state. We again formulate the result 
for the yes-no experiment F4(A) ; a more general situation will be discussed in 
Section 7.5: 


(Q3) suppose that the system before measurement is in a state W . If the result is 
positive (the measured value is contained in the set A), then the state after 
measurement is described by the statistical operator 


wy! = EACAYWEA(A) 

Tr (Ea(A)W) 
We can speak about the state W’ only when the probability w(A, A;W) is nonzero; 
in that case the right side makes sense and W’ is again a statistical operator (Prob- 
lem 11). Also the consequence of the postulate (Q3) mentioned in Section 7.1 
remains valid: if the yes-no experiment E4(A) is repeated immediately, we get the 
same result with certainty. 


7.4 Superselection rules 


The postulate (Qib) does not tell us whether any ray in the state Hilbert space 
H of a given system corresponds to some state. Similarly we can ask whether any 
self-adjoint operator on # describes an observable. The answer to both questions 
is, in general, negative. 


7.4.1 Example: Proton and neutron can be regarded as two isotopic states of a 
single particle — a nucleon; the corresponding state Hilbert space is €?. If we put 
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the proton charge e = 1, then the charge operator is represented by the matrix 
Q := $(03 + 1), and its eigenstates Up : Quy, = % and Yn: Qn = 0 describe 
the proton and the neutron state, respectively. Up to this point, the situation is the 
same as in Example 7.1.2. There is, however, a substantial difference between the 
two cases: experimental experience tells us that 


(a) no realizable state of the nucleon corresponds to a nontrivial superposition of 
the proton and the neutron state, 7.e., toa vector ~ = at,»+6y, with nonzero 
coefficients a, G, 


(b) any observable A commutes with the charge operator Q = ( : ) ; hence it 
can be expressed as A = \, ES" + )_E{ , where ES) a = he Oe 


The second claim follows from the first. Had an observable B := pF, + pop 
noncommuting with Q existed, then its one-dimensional spectral projections could 
not commute with @ either. A measurement of B would produce the nucleon in a 
state described by of the eigenvectors of B ; however, the latter are not contained 
in the rays ES”) C2. 


Consider now a general quantum system with the state Hilbert space H. The 
example shows that # can contain unit vectors to which no state of the system 
corresponds. Denote by F the set of all vectors to which some state does correspond. 
Without loss of generality, we may assume that F is total in H. This follows from 
the postulate (Q3) according to which we have F'4(A)F Cc F for any observable A 
and a Borel set A, and therefore also E,4(A)Hr C Hr where Hp := Fin. Let Ep 
be the projection to the subspace Hr ; then the vector E,4(A)Ery belongs to Hr 
for any w EH, i.e. E,(A)Ery = EpE4(A)Epry. Hence the projections E4(A) 
and Er commute for all A € B, and by the spectral theorem, the observable 
A is reduced by the subspace Hr. It is clear from the postulate (Q2b) that no 
information can be obtained experimentally about the part of A in H#, because 
all vectors corresponding to realizable states of the system are contained in Hp, 
and therefore we may put H = Hr. 

A set M C F is said to be coherent if there are no nonempty orthogonal sets 
M,, M2 such that M = M, U ™ ; if, in addition, M is not a proper subset of 
another coherent set we call it a mazimal coherent set. The closed subspace Min 
spanned by a maximal coherent set M is called a coherent subspace. 


7.4.2 Proposition: Any state Hilbert space contains a family {H.: a € J} 
of mutually orthogonal coherent subspaces such that H = D®.) Ha. 
Proof: Since the set F' is total by assumption, it is sufficient to find the decomposi- 
tion F = Usey Fa , where {F,} is a family of mutually orthogonal maximal coherent 
sets. The sought orthogonal-sum decomposition then holds with Ha := (Fa)un } 
this follows from the inclusions F C (®.;Ha C H and the condition F+ = {0}. 
To get the sets F, we introduce on F' the following equivalence relation: ¢ ~ 
if there is a coherent set M Cc F’, which contains both of them. The reflexivity 
and symmetry of the relation are obvious, so we have to check its transitivity. If 
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d~ w and y ~ x, there are coherent sets M,, Mz such that ¢,y € M and 
w,x € Mz ; if M := M,U Mp was not coherent we could express it as a union of 
nonempty orthogonal sets N,, N2. Suppose, for instance, that py € N, so M,N, 
is nonempty. At the same time it is orthogonal to M,M N2, and since M, = (Mn 
N,) U(M, No) is coherent, we have M, N2 = 0. The coherence of M2 implies 
in the same way M2MN2 = 0, so together we get No = (M1 N2)U(M2M No) = 0 
in contradiction to the assumption. Hence the set M is coherent and ¢~ x. 

We now use the described relation and identify the sets F, with the equivalence 
classes. We must first check that they are mutually orthogonal. If the vectors 
¢,y € F are nonorthogonal, the point set {¢,~} is coherent so ¢ ~ w. By 
negation, if ¢,~ belong to different equivalence classes they must be orthogonal. 
The sets Fy are coherent by definition; we have to show that they are maximal. 
Let N > F, be acoherent set; then N = NU Fy where No := Ugga(N O Fe). 
The orthogonality of the family {F, : a € J} implies N. 1 Fa, and since N is 
coherent we conclude that Na =@, i.e., F, is maximal coherent. J 


Hence the state Hilbert space # of any quantum system decomposes into an 
orthogonal sum of coherent subspaces. In particular, if H is separable the index 
set J is at most countable. Any vector ~ € F belongs to just one of the subspaces 
H. ; in other words if ¢ € H has nonzero projections to at least two coherent 
subspaces, it does not describe a realizable state of the system. 

Proposition 2 says nothing about which vectors within a particular coherent 
subspace correspond to realizable states. It follows from experimental experience 
that F, are subspaces in H ; for the sake of simplicity we assume, in addition, that 
they are closed, i.e., 


He = HaNF =F, aed; (7.8) 


we shall return to discussion of this assumption a little later. A system is called 
coherent if its state space is coherent, H = F. 


7.4.3 Remark: Up to now we have been discussing pure realizable states. Con- 
sider a mixed state described by the statistical operator W := );wj;E;. The 
projections E; correspond to pure states; hence if the state W has to be realizable 
the corresponding rays must be contained in some of the coherent subspaces. The 
assumption (7.8) yields the opposite implication: if W is reduced by all coherent 
subspaces, then any pure state contained in the mixture is realizable. Together we 
find that a statistical operator W describes a realizable state of the system iff it is 
reduced by all the coherent subspaces. 


The symbol © will denote the family of all observables of a given system. An 
important subset in it consists of bounded observables corresponding to Hermitean 
operators on B(H) ; we denote this by O,. We shall also use the W*-algebra 
A := Aw(O;) generated by the bounded observables. For simplicity we refer to this 
as to the algebra of observables of the system; we have to keep in mind, of course, 
that not every element of A is associated with an observable. 
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7.4.4 Theorem: Suppose the state space decomposes as in Proposition 2. Let 
assumption (7.8) be valid; then 


(a) the algebra of observables is reduced by all the coherent subspaces; if the index 
set J is at most countable we have 


ACID BUHL), (7.9) 


acl 


(b) if one of the sets O, O,, and A is irreducible, the same is true for the other 
two; in that case the system is coherent, 


(c) any self-adjoint operator associated with some observable is reduced by all the 
coherent subspaces. 


Proof: In order to prove the (a), it is sufficient due to Proposition 6.3.6 to check 
that each projection EF € A is reduced by all the projections E, corresponding to 
the coherent subspaces. Suppose that there is a projection & € A and a€ J such 
that FE does not commute with E,. Then we can choose a unit vector w € Ha 
such that both F,Ey and (I—E,)Ew are nonzero. To prove this claim assume 
first that (I—E,)E¢=0 for all 6 € Hy. Then we have (I—F,)EE, = 0 and at 
the same time, E,E(I[—E,) = ((I-E,)EE.)* =0,ie, EE, = E,EE, = EE, 
but the projections £, E, do not commute by assumption. Hence there is a unit 
vector % € Ha such that (I-F.)F.w #0. If LE, Ey = 0, then we would have 
(I-E,)Ew = Ey, i.e., Ep € Ht. In that case, however, ||Ey||? = (%, Ey) = 0 
so Ey = 0, which is impossible. 

According to the assumption (7.8) the vector 7% corresponds to a realizable 
state of the system. The projection E belongs to the algebra of observables; the 
probability of a positive result in the yes-no experiment F is |/Ew||? #4 0 and 
the state after the measurement is then described by the vector ~’ := Ey/||Ey|l. 
However, the latter has nonzero orthogonal components in both H, and Hi, and 
therefore no realizable state of the system can be ascribed to it; this concludes the 
proof of assertion (a). 

Next we shall check that if any of the sets O, Oy, A is irreducible, so also are 
the other two. It follows from Problem 6.8a that A = Of and so A’ = QO}. In 
Section 7.1 we identified the measurement of any observable A with the family 
of yes-no experiments €4 := {E,4(A): A € B}, so Ae O means E, Cc Oj. 
Using the notation of Section 6.5 we can write this as the identity O, = O, ; then 
Problem 6.24 gives A’ = O}, = O’ and the sought result follows from Schur’s lemma. 
If the sets under consideration are irreducible, we have A’ = ('(H) and therefore 
A = A’ = B(H), which means in view of the part (a) that the system is coherent. 
Finally, using once again the equivalence of the observable A with the family &, 
together with the spectral theorem we obtain assertion (c). & 


The assertion (c) of the theorem can be used to check whether a given system is 
coherent. If we find among the operators representing the observables of the system 
an irreducible subset, the set O is itself irreducible and the system is coherent. 
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7.4.5 Examples: (a) A particle on line: it is sufficient to take the position and 
momentum operators; the set {Q, P} is by Example 6.7.2e irreducible, so a 
one-dimensional particle represents a coherent system and A= B(L?(R)). 


(b) A system of spinless particles: Consider now the operators Q;, Pj, j = 
1,...,n, on L?(R") — see Example 7.2.3. The preceding example in com- 
bination with Theorem 6.7.3 shows that the set {Qi,.--,Qn,Pi,-.-, Pa} is 
irreducible; hence the corresponding system (for instance, a system of N spin- 
less particles for n = 3N ) is coherent and A = B(L?(JR")). 


We use the term superselection rules for the restriction to the set of admis- 
sible states, which is represented by the decomposition of the state space into the 
orthogonal sum of coherent subspaces. These rules are usually determined by a 
particular family of observables; the latter are sometimes called superselection op- 
erators. We have mentioned in Example 1 that the electric charge belongs to this 
family; other examples are the baryon number or the integrity /half-integrity of the 
spin. The superselection operators are usually of the form A := YocyAaEa, Le., 
they have pure point spectra and the corresponding eigenspaces are the coherent 
subspaces H,. All such operators commute mutually; this fact is usually referred 
to as commutativity of the superselection rules. 

Let us now return to assumption (7.8) about the closedness of the set of states. If 
we want to use it, we have to accept that the mean values of some important observ- 
able — for example, energy — may not be defined for some states. Unfortunately, 
this sometimes happens to states which are physically interesting. 


7.4.6 Example (Breit-Wigner formula): In Section 9.6 we shall discuss how un- 
stable systems can be treated in quantum theory. In the simplest approximation we 
associate with such a system the state 7%, such that the projection of the spectral 
measure of the Hamiltonian onto the corresponding one-dimensional subspace has 
the following form 


iF dr 
du, Eby) = = 

(te Hi" Wu) large aca 
where Ao, I are constants characteristic of the considered decay process. The mean 
value (H)y, makes no sense because the integral fp |A|d(u, Ey.) does not 
converge. 


The mean value of energy, of course, does exist for all pure states which are 
represented by vectors from D(H) (and obey the superselection rules). More 
generally, one can introduce the notion of a finite-energy state W as such that 
the integral fp |A|dTr (Ew) converges; the mean value (H)w for these states 
is given by the relation (7.7). In particular, the set of pure finite-energy states is 
nothing else than the form domain Q(H) of the Hamiltonian. 

One might therefore attempt to replace assumption (7.8) ky the requirement 
that the admissible (pure) states correspond to the dense subset F, := H.M Q(A) 
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in each one of the coherent subspaces. However, in that case a question arises 
whether the “true” physical states should not also exhibit finite mean values of other 
important observables such as positions and momenta coordinates, etc. Fortunately, 
the difference between such conjectures and assumption (7.8) has no measurable 
consequences. The reason is that experimentally we determine only the probabilities 
of the postulate (Q2b) and not the mean values directly; thus the convergence of 
integrals like (7.7) is a matter of our extrapolation. 

To illustrate this claim in more detail, let us introduce the set B(H) of bounded- 
energy states: it includes all W such such that the measure w(-,H;W) has a 
compact support, in other words, w(JR \ A,,H;W) = 0 for some interval A, := 
(—b,b). Any W e€ B(H) is obviously a finite-energy state; pure states of B(H) 
are described by analytic vectors of the Hamiltonian. We shall show that B(H) is 
dense in the set of all states in the trace-norm topology. 


7.4.7 Proposition: For any state W there is a one-parameter family {W,} C 
B(H) such that lim,... Tr|W-—W,| = 0. 

Proof: Given a statistical operator W we set Wy := MEW E,, where & := 
Ey(Ay) and N;* := Tr(E,W) ; the definition makes sense for all 6 larger than 
some by > 0 since limy_,...N;’ = 1 due to Problem 3.41. We employ the estimate 

Tr|IW-W| < Tr|IW-EW|+Tr|e(W-WE,)| + |1—N| Tr |E.W E,| 
< 2Tr|W-EW\+N -1. 
By the polar—-decomposition theorem, there exists a partial isometry U such that 


|W-—E,W| = U*(I[—E,)W. We express the trace by means of the basis {¢;} 
consisting of the eigenvectors of the operator W ; this yields 


Tr |W-E,W| = 3° w;(Ud;, I-Ee)b;) < D> w;l|(1- a) 95 - 
j j 


The series on the right side can be majorized independently of b ; the sought result 
is then obtained using the relation s-lim.(J-f)=0. & 


Experimentally we cannot decide (even in principle) whether a given state has 
bounded energy, because any actual energy measurement tells us that the system 
is in a state W such that for the yes-no experiments Ey(A;,), positive numbers 
ée, k=1,...,n, and some W© € J,(H) the inequalities 


[Tr (En(Ar(W-W))| < e& 


are valid. Proposition 7 and Problem 3.40b show that lim,.. Tr (A(W—W,)) = 0 
holds for any bounded observable A ; hence for all b large enough the states W 
and W, cannot be experimentally distinguished. 


7.5 Compatibility 


We know that a measurement changes the state of the system. If several measure- 
ments are performed successively, the result may depend on the order in which they 
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are done; hence it generally makes no sense to speak about a simultaneous observa- 
tion of the corresponding dynamical variables. In some cases, however, the order is 
irrelevant. Observables A,, A are said to be compatible if 


w(Ao, Ag} Arg Ai; W) = w(Aj, Aj; Ao, Ag; W) 


holds for any state W of the system and arbitrary Borel sets A,,42 C RR; more 
generally, the observables of a family { Ag: @ € I}, where J is any index set, are 
compatible if any two of them are mutually compatible. We again implicitly assume 
that the two mesurements follow immediately one after the other, so the change of 
state in the meantime can be neglected. 


7.5.1 Proposition: The observables {Ag : 3 € I} are compatible if the cor- 
responding self-adjoint operators form a commutative set. For each finite subset 
{ Ag,,.--,Ag, } we have 


aw( Aves Ag,} ees A, Agz,} W) => 21) Agguarys Bipiges: Gta s Ang, Ages, W) ; 


where 7 is any permutation of the set {1,...,n}. The compatibility is reflexive 
and symmetric but not transitive. 

Proof: It is evident from the definition of a commutative operator set that it is 
sufficient to check the equivalence for a pair of operators A,, Ag. If they commute, 
the projections E'4,(A;), E4,(A2) also commute for any Borel sets A,,A. Cc R; 
the corresponding probabilities then coincide by Problem 12. To check the necessary 
condition it is enough to consider pure states W := Ey only. The relation between 
the probabilities then becomes (7, FE’Ew) = (wy, E’EE'w), where we write for 
brevity E := Ey,(A;) and E’ := E,,(Ae2). This is valid by assumption for all 
realizable states, i.e., for any vector ~ contained in some of the coherent subspaces 
E,H. The projections E, FE’ commute by Theorem 7.4.4 with all E,, and the 
same is true for the operator C := EE’E— E'EE"'. The condition can be rewritten 
as (E.¢, CE.) = (¢,CCE.¢) = 0 for any ¢ € H, which readily implies C = 0. 
Denote further B := EE’'— E'E ; then C =0 gives B*B =0, and therefore B = 0 
according to identity (3.5). Hence it follows from the compatibility assumption that 
the projections E4,(4,) and E,4,(A2) commute for any Borel A,, A, € R, ie., 
the commutativity of A,, Ao. 

The reflexivity and symmetry are obvious from the definition. Any observable 
is compatible with the trivial one represented by the unit operator. Hence if the 
compatibility were transitive, the observable algebra would have to be Abelian; it 
is easy to check that this is not the case with the exception of the trivial case when 
each coherent subspace is one-dimensional. 


7.5.2 Example: Consider the operators Q; of Example 7.2.3. By Proposition 5.7.3 
they commute, which means that the Cartesian coordinates are compatible. The 
same is true for operators P; : it follows from (7.4) that the Cartesian coordinates of 
the momentum are also compatible. Both conclusions are confirmed by experience; 
they allow us, in particular, to speak about measuring positions or momenta in a 
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system of N particles. On the other hand, the Cartesian coordinates are incompat- 
ible with the corresponding momentum coordinates because the operators Q; and 
P; do not commute. This has important consequences which we shall discuss in the 
next chapter. 


According to Proposition 5.1.9 we are able to associate with commuting self- 
adjoint operators A,,...,A, a unique projection—-valued measure E on JR” such 
tiem (ex -- xwAy)o= Bags)... Bag An) forany BorelWAy,...4A, CIR. 
Given a state W we can then define the map w: B” — [0,1] by 


w(A, {Aj,...,An};W) := Tr(E(A)W) (7.10) 


for any A € B". We can extend to it a part of the assertion of Theorem 7.3.3; it 
is sufficient to realize that the corresponding proof does not employ the fact that 
Ea(-) is a projection—valued measure on R. 


7.5.3 Proposition: The map w(-,{Ai,...,An};W) defined by (7.10) is a proba- 
bility measure on JR” for any compatible observables Aj,..., 


Now we want to know how this measure is related to the probability of find- 
ing the result of measuring A ,,...,A, regarded as the n-tuple of real numbers 
{Ai,---;An } inaset A C JR”. In the case when A := A, x--- x A, is an interval 
in JR" the sought probability is Tr(E(A)W) according to Problem 12. At the 
same time we know that there is no other Borel measure with this property; if two 
Borel measures coincide on intervals in JR” they are identical by Theorem A.2.4. 
This argument is the motivation behind the natural generalization of the postulates, 
which is again confirmed by experimental evidence: 


(Q2b) the probability of finding the result of a simultaneous measurement of com- 
patible observables A,...,An in aset A € B” when the system is in a state 
W equals Tr (E(A)W), i.e., it is expressed by the relation (7.10) regarded as 
an identity, 


(Q3) the state after such a measurement is described (in case of the positive result) 
by the statistical operator (Tr(E(A)WE(A)))E(A)WE(A). 


The extension covers situations when the measurement cannot be reduced to a finite 
sequence of elementary acts, for instance, ascertaining the presence of a particle in 
a spherical volume. 


7.5.4 Examples: (a) Consider the operators Q; of the preceding example. The 
measure FE is easily found: we have E(A) = xa(Q), where Q stands for 


{Q1,---;Qn}, so Example 5.5.8 gives (E(A)w)(x) = xa(x)¥(x). In particu- 
lar, if the system is in a pure state V we get from here 


w(A,Q50) = f W(a)Pa 
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for any set A € B". This represents a natural extension to (7.3); recall that 
this formula historically played an important role in constituting quantum 
mechanics (the statistical interpretation of the wave function postulated by 
M. Born, P. Dirac and P. Jordan in 1926). 


(b) A particle with nonzero spin: Spin has a double meaning: i the triplet 


of observables S;, 7 = 1,2,3, and secondly, a number s = 0,2 ie oo . The 
state space of such a yartiele is H := L?UR?; C+") ; due to (2.6) it can be 
expressed as H = L?(JR*) @ €*%+!, where the spaces PUR?) and ‘C=! .eor- 
respond to the configuration and spin states of the particle, respectively. The 
operators S; are usually defined by means of the orthonormal basis {Xm}i=-s 
of the eigenvectors of $3, 


(Si £iS2)xXm = ¥(sFm)(stM+1)xXme1, S3xXm = Mm; (7.11) 


we can easily check (Problem 15) that they are Hermitean and satisfy the 
relations 


(Sj, Ss] = tgeS;, S*:= SPHS+ Sp = ser, (7.12) 


where I, is the unit operator on @?%+! and we have adopted the stan- 
dard convention according to which one sums over the repeated indices in 
the commutation relations. The spectra of these operators coincide, o(S;) = 
{-s,-s+1,...,s}. 

Dynamical variables such as position, momentum, spin, etc., are, however, 
ascribed to the particle as a single entity, and therefore operators on the total 
state space H should correspond to them. They are of the form 


Q, = Q;@1,, P; = P@l,, 8; = I, @ Sj, 


where j = 1,2,3 and /, is the unit operator on L?(JR*) (we shall use the 
underlined symbols only if it is necessary to stress the relations to the operators 
on the “component” spaces). According to Proposition 5.7.3, they are self- 
adjoint and o(Q,) = 0(P;) = R while o(S;) = {=s,-—s + 1,...,s}. In 
view of the above mentioned relations no two of the operators S; commute, 
but all of them commute with @, and P, for k = 1,2,3. a the 
commutativity of Q;, Qx implies Sheet the operators Q, , k = 1,2,3, commute 
mutually, and the same conclusion can be drawn for Zz, ,k =1,2,3. Hence we 
can choose for a set of compatible observables one of the spin components (or 
a real linear combination of them) together with the components of position 
or momentum; most frequently one uses the sets 


= {Q,,Q,,Q5)Ss ts ps = { BdtoaBowSian.. 


Notice that a := I,@ S? = s(s+1)/ ; this provides an example of a particular 
observable represented by a multiple of the unit operator. The measurement 
of spin squared can yield only the value s(s+1) ; this gives us right to speak 
about a particle with spin s. 
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Let us now return to the problem of how the state of the system is changed after 
measurement. In the above formulation of the postulate (Q3) we have assumed that 
we register the results of all performed measurements. However, this is not the most 
general case. 


7.5.5 Example: Suppose that we perform a sequence of compatible yes—no exper- 
iments E := {F,,...,E,} on the system in a state W ; the result is an ordered 
n-tuple r := {r1,...,7}, where the numbers r; assume the values 0,1. To be 
able to express the probabilities w(r) := w(r, E;W) we introduce the projections 


Te 


E(r) — [](4;4,-,; “f (I—E;)6o,,) : 
j=l 
Obviously, E(r)E(r’) =0 for r#r’ , and furthermore, ¥,cy E(r) = I where the 
sum runs over the set M of all the 2” different n—tuples. 

Up to now we have just a particular case of the measure E which is supported 
here by the set M ; the probability w(r) is then given by Tr(E(r)W) and the state 
after the measurement is described by the operator w(r)-!E(r)WE(r). Suppose 
now that we have registered only a part of the results, i.e., that we do not distinguish 
the states of a subset Myeg C M. Examples: 


(i) we register only the result 7; of the first experiment E; ; then Mreg := {r: 
r= Ty } ; 


(ii) if we remember only that the result contained the number one k-times, then 
Moogaeit: r) +:-2ttea=k}, 


(iii) if we have registered the result completely, then M,., consists of a single 
r-tuple. Conversely, if we have registered nothing one has M,.. = M. 


The state of the system after such a measurement is the mixture of the states 
w(r) 1 E(r)WE(r) for all r€ M,eg with the weights w(r) ; in other words 


W=N > E(r)WE(r), 


r€Mreg 


where the normalization factor is given by N~* := Dyem,., w(r) = Tr (E(Mreg)W) ; 
the orthogonality of the projections E(r) implies that E(Myeg) = Lrem,., H(r) is 
a projection. In the particular case when we register nothing the state is described 
by Wy := Drem E(r)WE(r). It is easy to check that 


Wee E(Myeg) Wu E(Mreg) 
Tr (E(Mreg)Wx) 
This means that the considered measurement may be regarded as consisting of two 


operations: first we let the system pass through the device without registering the 
results and afterwards we perform the yes—no experiment E(M;e,). Notice that 
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these results simplify if the yes-no experiments F,,...,E, in question are disjoint 
(Problem 14b). 


Functional relations between observables are not a self-evident matter in quan- 
tum theory. In classical physics the relation B = f(A ;,...,An) between dynamical 
variables has a clear meaning: if the Aj assume the values a;, then the quantity 
B assumes the value f(a1,...,@n). Compatible observables are important, in par- 
ticular, because they make it possible to define similar relations. In the quantum 
case, however, we are allowed to speak only about probabilities of the measure- 
ment outcomes. We shall thus formulate the definition as follows: let {A ,..., An} 
be compatible observables and f : JR" — R a Borel function; then the relation 
B= f(Aj,...,An) means that 


w(A, B;W) = w(f-(A), {Ai,...,4n}; W) (7.13) 


holds for any state W and an arbitrary Borel set A Cc JR, where B, Aj,... An are 
the self-adjoint operators corresponding to the observables under consideration. The 
operator A := f(Aj,...,An) is self-adjoint by the definition given in Section 5.5; 
we shall check that B= A. 

According to assumption (7.13), (~, Eg(A)w) = (4, E(f-P(A))wv) holds for 
any pure realizable state UY, where E is the projection—valued measure correspond- 
ing to the operators A;,...,A,. Both projections are reduced by all coherent sub- 
spaces; it readily implies Eg(A) = E(f‘-)(A)) for any A € B. On the other hand, 
we have E,4(A) = E(f'-(A)) due to Proposition 5.2.12, i.e., Ea(J) = Eg(J) for 
any interval J Cc JR, and therefore the two operators coincide. We have proven 
in this way that B = f(A ],...,An) as defined above implies the same functional 
relation between the corresponding self-adjoint operators. Hence we shall employ 
the same symbols again in the following for both the observables and the respective 
operators. Using in addition Theorem 5.5.10 and Problem 5.33c, we arrive at the 
following conclusion. 


7.5.6 Proposition: Suppose that Ai,...,An are compatible observables and f : 
JR” — R a Borel function; then the observable A := f(Aj,..., An) is compatible 
with Aj,...,An. If the function is continuous, we have o(A) = f(Ai,...,An)- 


7.5.7 Remarks: (a) The above argument can be reversed. If we have obser- 
vables represented by the commuting operators A;,...,An, then the relation 
B= f(A,,...,An) defines a new observable which satisfies (7.13). Its inter- 
pretation is clear: we measure it by the same device as the compatible family 
Aj,...,An but change the scale: the point A on it corresponds to the points 
of f-)(X) on the original scale. 


(b) Postulate (Q2a) means implicitly that the measured values are real numbers. 
In practice we often meet measurable quantities assuming complex values — 
let us mention just the example of the scattering operator which we shall 
discuss in Chapter 15. They are usually represented by complex functions of 
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commuting self-adjoint operators so no problems arise: we proceed as above 
using complex numbers to scale the apparatus. It is therefore useful to regard 
complex Borel functions of “real” observables as observables in a broader sense; 
by Theorem 5.2.11 they are described by normal operators. 


7.5.8 Example: Cartesian coordinates of momentum are compatible by Example 2. 
The kinetic energy of a spinless particle of mass m is described by a function of 
them, Ho := h(P), where h(p) := (2m)~!(p7+p3+p3) ; according to Example 5.2.10 
and the functional—calculus rules, this is equivalent to 


1 
| Ho = = P? = ai thts). 
Since the function h is real-valued, Hp is self-adjoint and Proposition 6 gives 
o(Ho) = IR*. Moreover, it follows from the equivalence (7.4) that Hy = Fy 'T),Fs, 
where 7; is the operator of multiplication by h. This yields the expression for the 
domain, D(Ho) = F;'Dp, where Dp := { € L?(IR?): fips x*|b(z)|?dx < 00}. In 
addition, using Examples 5.3.11c and 5.4.9b we get 


Foo) = CR in caso) = Gees(Ho)= a( Ao) =r. (7.14) 


Another way to write the operator Hp is with the Laplacian: the relation (7.5) 
together with P;S(IR*) Cc S(IR*) imply 


at mt) (ct) (7.18) 


(Hop) a) = — =—(A¥)(a) = - 5 (3 met oe toe 


for all ~ € S(JR°). This is true also for other vectors from D(Ho) ; what is 
important is that the operator ~(2m)~'A is e.s.a. on S(JR*) (Problem 16). 

Let us mention one more property of the operator Ho, namely that its domain 
D(Ho) c L®(UR%) , and for any a > 0 one can find a 6 such that 


IIPlloo < al] Hol] + bllyll (7.16) 


holds for all ¢% € D(Ho), where |j- || = || - ll2 is the usual norm in L?(JR®). To 
check this inequality, consider an arbitrary vector @ := F3y € D,. The identity 
@ = (1+h)-1(1+h) together with the fact that the functions (1+h)7! and (1+h)¢ 
belong to L?(JR?) imply ¢ € L'(R) ; hence w = Fy‘ belongs to L°(R°) and 
(27)3/2 ||| 0 < ||dl|,.. Furthermore, the Schwarz inequality gives 


IIPlloo < el](L+Th)Oll < c(llTrdll + Il9ll), 


where c := ||(1+h)71||. Next we define for any r > 0 the scaled function ¢, : 
r(x) = r°d(rx) ; the relations ||¢,||? = r°||Ol|? and ||Th¢-||? = r~"||Th@ ||? show 
that it belongs to D,. Substituting ¢, for @ in the last inequality, and using (7.4) 
together with the unitarity of F3 we get 


(2)? |[tblloo S z Tal] + or? Holl = Se Howl + or 
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which yields (7.16) if we choose r := c*/a?(2m)*. Notice that while the results from 
the first part of the example extend easily to the n-dimensional case, the inequality 
(7.16) does not hold for n > 4 (see the notes). 


Let S be a family of compatible observables. Due to Proposition 6 it is possible 
to append to it functions of the observables from S ; it may happen that this will 
exhaust all dynamical variables compatible with S, i.e., that we can find no other 
“independent” compatible observables. In such a case we call S a complete set 
of compatible observables. It is clear from Proposition 1 and Section 6.6 that 
operators representing the elements of S then form a complete set of commuting 
operators; we shall use the shorthand CSCO for both families. It is natural to ask 
whether a set of compatible observables can always be completed to a CSCO. 


7.5.9 Theorem: For any family of compatible observables there is a CSCO Swaz 
which contains it, Smaz DS. 


Proof: The operator set S is by assumption commutative and symmetric, and 
according to Problem 6.24 the same is true for Sy ; it follows that S” = S; 
is an Abelian +*-subalgebra in the algebra A of observables of the given system. 
The Zorn lemma readily implies the existence of a maximal Abelian *-subalgebra 
BCA which contains S”; the inclusion S” C B then yields S’ D B’. Using the 
notation of Section 6.5, we construct to B the set Br for which (Br)’ = B’ in view 
of Problem 6.24. At the same time, we have B = B’ = B” by Proposition B.1.2e 
because B is maximal Abelian, and therefore (Br)’ = (Br)” so Br isa CSCO. The 
inclusion S C Br need not be valid if S contains unbounded operators. However, 
one can replace Br by the commutative set Smar := SUBR C Loa(H) in which S 
is contained. We have S/,,, = S’N (Br)’, and since B’ = B” we get Si, = Say} 
this concludes the proof. §j 


In fact, the set Smer constructed in the proof is unnecessarily large; one can 
even get a CSCO by adding a single Hermitean operator to S provided H is 
separable (Problem 20). Stated in that way the result has only an abstract meaning; 
nevertheless in practice we always look for a CSCO consisting of a small number of 
particular observables. 


7.5.10 Example: In view of Example 4a and Corollary 6.6.6 each of the sets 
{Qi,.--,Qn} and {P,,...,P,} forms a CSCO for a spinless particle (if n = 3) or 
a system of such particles. Similarly any of the sets S,, and S,, from Example 4b 
is a CSCO for a particle of spin s. 


Complete sets of compatible observables play an important role in preparation 
of the state. Measurements included in this procedure will be summarily called 
a preparatory measurement. According to postulate (Q3) its outcome depends on 
the state before the measurement which we, however, do not know. One can get 
rid of this dependence provided the observables whose values are determined in the 
preparatory measurement form a CSCO. 

Consider first a CSCO S := {Aj,..., An} consisting of observables with pure 
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point spectra; for simplicity we assume that the state space H is separable. We 
shall use the notation introduced in Section 6.6: the probability of finding the 
N-tuple of values Ay := {oid me} for any {k} equals w(Ai,},S;W) = 
Tr (P.;W), and the state after the measurement is described by the statistical 
operator W’ := (Tr(Pi.}W))"'Pu;W Pre}. Recall that all the projections Pr} 
are one-dimensional; this enables us to simplify these relations choosing a unit 
vector ~%,%} in each one-dimensional subspace. In particular, the state after the 
measurement becomes 


r_ __(e,") 
- (deny, Weeay) 


so W’ = P,,. Hence we arrive at the following conclusion. 


Pry Woy = (ays) Pe » 


7.5.11 Proposition: Suppose that # is separable and S := {Aj,...,An} isa 
complete set of co mae: les with pure point spectra. If the ‘iegattt tient 
yields the values ye sa OL , then the state after the measurement is described 
by the corresponding common eee {x} independently of the state in which 
the system had been before the measurement. 


It is also obvious that we can in this way, at least in principle, obtain all inde- 
pendent states of the system because the vectors #4} form an orthonormal basis 
in H. The situation is more complicated if the CSCO used for the preparatory 
measurement contains observables with a nonempty continuous spectrum. In such 
a case it is not possible to suppress the dependence on the original state completely; 
however, it can be minimalized provided the measurement is exact enough. 


7.5.12 Example: Suppose we have a particle in a state Y and measure its mo- 
mentum; if its value is found in a set A C JR®, the state after the measurement is 
described by 

Uh = N(A) Fs 1B(A) Fat, 


where E(A) := ya(Q) and the normalization factor N(A) := ||E(A)F3y||-!. We 
take the ball U.(ko) for A and see how the state ~, behaves in the limit e¢ - 0. 

Let (F3~)(ko) be nonzero; without loss of generality we can assume that it is 
positive. For the sake of simplicity, assume also the vector w belongs to S(JR°). 
Then F37 € S(R*), and therefore (F3w)(k) #0 in U.(ko) for any € small enough; 
this ensures that the normalization factor — which is due to the mean—value theorem 
equal to |(F3~)(ke)|?V., where ke is some point of the ball and V. := 47° is its 
volume — is nonzero. Using once more the assumption 7 € S(JR°), which allows 
us to interchange the integrations, we can express the function wi as 


N a 


Wa (x )= (nye pe , Yvly) es eh (2-9) gh. 


The inner integral can be computed in an elementary way (see also Problem 19) to 
be V.g(elx — y|) e*'(*™ , where g(z) :-= 3z~*(sin z — z cosz). It is easy to see that 
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|g(z)—1| < Cz? for some C >0 and all z € Rt (Problem 21); it follows that the 
limit € > 0 may be interchanged with the outer integral. At the same time, the 
estimate shows that lim,49g(€z) = 1. Moreover, ke + ko as € +0, and since the 
function F327) is continuous, we finally obtain 

lig Ve?) = Goa, (7.17) 
where the convergence is uniform in any bounded set M C JR. Hence independently 
of 7) we obtain the standard plane-wave expression. It does not belong to Let”, 
of course, but the result tells us that by a sufficiently precise measurement of the 
momentum we can achieve that the state is in a chosen (bounded) spatial region M 
approximated arbitrarily closely by the right side of (7.17). Let us remark that the 
smoothness assumption we have made can be weakened substantially (Problem 21). 
On the other hand, the argument cannot be used if (F3~)(ko) = 0 ; this is not 
surprising, however, because this condition corresponds to the heuristic claim that 
“the state |ko) is not contained in the superposition |7)”. 


7.6 The algebraic approach 


Physical theories usually result from unification and generalization of empirical in- 
formation (like every rule, this one has exceptions; compare with the general theory 
of relativity). Only later is the effort made to select a few basic ones among the ini- 
tial facts and to derive the others in a deductive way. Such an activity is, of course, 
meaningful; we need it to make the theory transparent by finding a suitable math- 
ematical language for it, to check its internal consistency and decide whether some 
of the starting assumptions are not irrelevant or lacking an empirical foundation. 

An axiomatic system on which the theory is based must be free from contra- 
dictions; this requirement is common to all theories, which employ mathematical 
methods. For mathematical theories this is the only requirement; on the other 
hand, a physical theory should describe a part of the existing world, so its axioms 
have to be supported by an empirical evidence. It took a long time, in fact, be- 
fore this difference was fully recognized; it took about twenty two centuries for the 
mathematical nature of Euclid’s axioms to become clear. 

Axiomatic systems usually change in the course of time when the development 
of the theory enables some postulates to be replaced by simpler or more general 
ones. We restrict ourselves to the example of quantum mechanics: the postulates 
(Q1)-(Q3) formulated in the preceding sections represent the result of the effort of 
J. von Neumann to unify the two “pre-quantum” mechanics. However, this was not 
the end of the quest for an optimal axiomatic system. One reason is that postulate 
(Q1) has only an indirect empirical justification as we noted in Section 7.1; this 
flaw can be removed by axiomatizing properties of measurements. We shall discuss 
this problem in Chapter 13; now we want to mention another approach to quantum 
theory, which is based on axiomatization of algebraic properties of the observables. 
Some remarks on the history of this idea are given in the notes. 
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Let us look in more detail at the structure of the set of observables for a quan- 
tum system. Some of them have a well-established meaning and they are measured 
by particular experimental techniques; sometimes they are called fundamental ob- 
servables. Examples are coordinates, momenta, energies, charges, etc. In addition, 
the set of observables includes other dynamical variables which have no direct ex- 
perimental meaning but they are functionally related to the fundamental ones. In 
classical mechanics these observables in a broader sense are all functional expressions 
of the type f(q,p). 

In the quantum case the situation is more complicated because not all funda- 
mental observables are mutually compatible. Nevertheless, some functions can be 
defined, including in the first place 


(i) a real multiple Aa of the observable a is measured by the same apparatus which 
is linearly rescaled, 


(ii) the sum of observables a, b, which is understood to be an observable c such 
that (c)g = (a)g + (b)s holds for any state ¢. 


In particular, if the observables a, b are replaced by the families of yes—no experi- 
ments in the way described in Section 7.1, then the mean values are nothing else 
than the probabilities of finding a positive result in such experiments. At the same 
time, we have to stress that the existence of the sum for a pair of observables is 
only assumed: if a, b are noncompatible it is not clear, in general, how to define 
the procedure c, i.e., to construct a suitable device so that the mean values (c)¢ 
would satisfy the above mentioned identity. 

Note that the definition (ii) can also cover some fundamental observables. As an 
example one can take the hydrogen atom, whose Hamiltonian in the center-of-mass 
frame is H := (h?/2m)P?— e?Q-! with Q-} := (Q?+Q2+Q2)-1/. The identity 
(H)4 = (h? /2m)(P?)g—e?(Q-')¢ is valid, of course, provided the mean values make 
sense. The eigenvalues of H are determined, however, by measuring the frequencies 
of the photons coming from transitions between different energy levels; there is no 
direct correspondence to the measurements of the electron position and momentum. 

For simplicity we shall again consider in the following the set ©, of all bounded 
observables of the given system. Prescriptions (i) and (ii) define on O, the algebraic 
structure of a real vector space. We have to equip ©, with a topological structure 
because its dimension is infinite in all practically interesting cases. A natural way 
to do this is to define the norm ||al| := sup, |(a)4| for any observable a € O,, where 
@ runs through all states of the system; this makes ©, a real normed space. The 
algebraic structure can be further enriched; we use the fact that O, also contains 
powers of the observables which can be easily defined: 


(iii) the observable a” is measured by the appropriately rescaled apparatus. 


The operations of summation, multiplication by a real number, and at> a? turn O, 
in the way described in the notes into a real commutative, in general nonassociative, 
algebra on which one can define an involution by a* := a. 
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The algebra ©, can be required to satisfy various physically motivated con- 
ditions; two such systems of axioms are mentioned in the notes. The classes of 
algebras they determine are still too wide, and therefore another assumption is 
added to which no physical foundation has been found up to now: we suppose that 


(ao,) the algebra ©, can be identified with the set of all Hermitean elements of 
some C*-algebra A ; the product a- b of the elements of ©, is related to 
the multiplication in A by a- b= $(ab+ ba). 


For simplicity the algebra A is often also called the algebra of observables of the 
considered system. 

If we adopt this postulate we are able to employ the results of Sections 6.1 and 
6.2. States of the system are then identified with the positive linear functionals 
¢@ on A that satisfy the normalization condition ¢(e) = 1 for the unit (trivial) 
observable e. We denote as Sy the set of all states on A. Since this notion must 
be consistent with the physical concept of state introduced in Sections 7.1 and 7.3, 
one more postulate has to be added: 


(ao,) (a)y = (a) holds for any state ¢ on A, where the left side means the mean 
value of the results of measuring the observable a in the state ¢. 


This also ensures that another definition is consistent: using Theorem 6.2.6 we can 
check that the norm of a as an element of the algebra A coincides with the norm 
introduced above, 


[lal] := sup |¢(a)|. 
oESA 


We know from Section 6.2 that the set S,4 is convex. Its extremal points form 
the set Py, of pure states, while the other states are called mixed; a necessary and 
sufficient condition for a state to be pure is given by Theorem 6.2.7. 

The most important consequence of the postulate (a0.) is that it allows us to 
use Theorem 6.2.6 by which there is an isometric representation of the algebra A 
on some Hilbert space #1. Then we are able to represent (bounded) observables of 
the system by Hermitean operators on . Since A as an operator algebra supports 
other topologies we can also strengthen the postulate (ao,) assuming that 


(ao.,) the set ©, can be identified with the set of all Hermitean elements in some 
W*-algebra AC B(H). 


It is appropriate to return now to the definition of the algebra of observables pre- 
sented in Section 7.4. The arguments presented here show what its meaning is: 
roughly speaking, we start from the family of fundamental observables for the con- 
sidered system, extend it algebraically and topologically, and identify observables in 
a broader sense with all Hermitean elements of the algebra A obtained in this way. 
The choice of a complex algebra is motivated by the requirement of simplicity. 
The postulate (ao,,) provides finer means to classify the elements of S4. We 
are interested primarily in normal states since the other states can have rather 
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pathological properties mentioned at the end of Section 6.4. Concerning normal 
states, we have proven than any such state has a trace representation by means of 
some operator W € J,(H). For the particular class of type J W*-algebras in 
which all minimal projections are one-dimensional we have a stronger result given 
by Theorem 6.4.7. This result acquires a physical meaning if we suppose that H 
decomposes into the orthogonal sum 2c; Ha of coherent subspaces. In general, 
the algebra A then satisfies the inclusion (7.9). If the latter turns into identity, 


A= ¥°B(He), (7.18) 


aceJ 


then to any normal state ¢ on A we can ascribe just one statistical operator 
W € Ji(H), which is reduced by all the coherent subspaces Ha in such a way that 
¢(B) = Tr(WB) holds for any B € A, in particular, for any observable of the given 
system. The state is pure iff the operator W is a one-dimensional projection. 

If we therefore add the assumption (7.18) to the postulates (ao) we fully recover 
the standard formalism discussed in the preceding sections. Observable algebras of 
the type (7.18) are typical for quantum mechanical systems. Recall Example 7.4.5b 
by which the observable algebra for a system of N spinless particles is B(L?(JR?")) , 
but also in more general situations when other degrees of freedom and superselection 
rules are involved, A is still of the form (7.18). 

We must therefore ask what the algebraic approach is good for. We have to 
realize first of all that assumption (7.18) plays a crucial role here; once we abolish 
it we are not able to use the argument which ensures the uniqueness of the trace 
representation, i.e., of the operator W in Theorem 6.4.7. At the same time, the 
assumption need not be valid when systems with an infinite number of degrees of 
freedom are considered; for example, it is known that any algebra of local observables 
of a free quantum field is of type [JI (see the notes). 

There is a deeper reason, however. The abstract algebra A can have different 
nonequivalent representations corresponding to different physical situations. For in- 
stance, quantum fields differing by mass or interaction can correspond to nonequiv- 
alent representations of the same algebra of observables — cf. Section 12.3. In such 
a case the algebraic description of the observables and states has to regarded as pri- 
mary because it deals with the properties of the system which are representation— 
independent. Note that in order to decide whether A can have nonequivalent 
representations it is not necessary to investigate all its elements; it is sufficient to 
restrict our attention to the fundamental observables that generate it; an example 
of fundamental importance will be given in the next chapter. 


Notes to Chapter 7 


Section 7.1 As mentioned in the preface, the contents of this and the following chapters 
are not intended as a substitute for a course in quantum theory. There are numerous 
textbooks which the reader can consult for the physical material discussed here: as a 
sample let us mention [Bo], [Dav], [Dir], [LL], [Mes] for quantum mechanics; [ BD], 
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[BS], [IZ], [Schwe] for quantum field theory, in particular, [IZ], [Hua 2], [SF] 
for the theory of non-Abelian gauge fields and its applications in physics of elementary 
particles; [Fey], [Hua 1] for quantum statistical physics, and many others. At the same 
time, applications of quantum theory in different parts of physics, and also chemistry, 
biology, etc., are nowadays so plentiful and rapidly multiplying that it is meaningless even 
to attempt to compile a representative list of references. 


On the other hand, there is an extensive literature devoted to mathematical aspects 
of quantum theory. The pioneering role was played by the classical monographs [vN] and 
[Sto]; following them many authors have analyzed the basic concepts of the theory, their 
properties, relations, and generalizations — see, e.g., | BeS]; [Da 1], Chaps.2-4; [ Ja}; 
[Jor]; [Ma 1,2]; [Pir]; [Pru]; [Ri 1]; [Var], and others. As we have also mentioned, the 
main interest in rigorous quantum theory has shifted gradually from general problems to 
analysis of particular systems — this is the main topic of monographs [RS 2-4], [Sche], 
[Si 1], {Thi 3], and many others; some of them will be mentioned at the appropriate 
places below. Rigorous methods of quantum field theory are discussed, e.g., in [ BLOT ], 
[Em], [Sv], [SW], and also in monographs [GJ], [Sei], [Si 2] and others which 
concentrate on the so-called constructive approach. Mathematical aspects of quantum 
statistical physics are treated, for instance, in [BR], [Em], [GJ], [Sin]. 

The facts from probability theory that we shall need are contained in standard text- 
books, e.g., [ Fel], { Par], [Sir]. The “operational” definitions discussed here can be 
found, e.g., in [Ja], Chap.6. Notice that in reality one has to associate measuring devices 
only with some important observables; we return to this problem in Sec.7.4. 

The mentioned consequence of postulate (Q3) for a pair of identical yes—-no experi- 
ments performed immediately one after the other concerns situations when the outcome 
of the experiment characterizes the state after the measurement. An example is the regis- 
tration of a particle by a Geiger counter: if the apparatus clicks we know that the particle 
has been found in the sensitive volume of the detector. We usually refer to such situations 
as measurements of the first kind. There are also second-kind measurements, where the 
measured values refer to the state before the experiment has been performed; recall, for 
instance, measuring the excited-level energies of an atom by registration of the photon 
frequencies coming from their deexcitation. First-kind measurements are simpler and we 
shall deal mostly with them in the following; their distinctive feature is that they can serve 
as preparatory measurements — see Sec.7.5. 


The argument following Example 3 should not be interpreted as a claim that the 
exact values of physical quantities are never of importance. In the classical theory of 
dynamical systems and its quantum counterpart, e.g., the systems exhibit a different 
behavior depending on whether certain parameters (such as ratios of driving frequencies, 
the sizes of the region in question, etc.) are rational or irrational (even the kind of 
the irrationality is important) — see, e.g. [CG 1], [Com 1], [MS 1], [JL 1], or 
[Seb 1]. Nevertheless, one cannot decide the value of the parameter experimentally: if 
we approximate an irrational by a suitable rational number the system will exhibit the 
behavior characteristic of the irrational value for some time (or in some interval of energy; 
the longer the interval the better the approximation is), but eventually it comes out that 
the value is rational after all. 


Section 7.2 The “eigenvectors” corresponding to the continuous spectrum used in quan- 
tum mechanical textbooks do not, of course, belong to the state space but this does not 
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mean they are useless. In fact, many of the formal considerations involving these eigen- 
functions can be made rigorous either in the so-called rigged Hilbert space framework — 
see, e.g., [EG] and references therein — or even within the standard Hilbert-space theory 
by the eigenfunction expansion method, which in a sense represents a generalization of the 
Fourier transformation — cf. [ AJS], Chap.10; [RS 3], Sec.X1.6. 

Example 2 should be regarded as a warning against the dangers which may await 
you if instead of proving a given observable to be (essentially) self-adjoint you merely 
check formally that it is “Hermitean”, i.e., symmetric. At the same time, the employed 
derivation of a family of Hamiltonians based on the theory of self-adjoint extensions 
(see also Problems 2-4) gives a glimpse of a powerful method of constructing various 
solvable models of quantum systems; we shall return to this in the last two chapters. The 
operator J of Example 4.8.5 also has physically meaningful non—selfadjoint extensions 
but their interpretation within the standard quantum mechanical formalism requires a 
longer explanation — cf. [Ex], Sec.4.3. 

According to Problem 5, S(JR*) is a core for the momentum-—component operators 
P; . Their domains are by definition D(P;) := Fy +D(Q;), but we can also describe them 
explicitly as the subspaces consisting of those ~ € L?(JR°) for which the right side of 
(7.5) makes sense as a distribution and belongs to L?(JR°). In the same way one can 
specify the domains of more complicated partial differential operators through so-called 
Sobolev spaces. The proofs are based on properties of the Fourier transformation which 
go far beyond Example 1.5.6; we refer, e.g., to [ Ad] or [RS 2], Chap.IX. The double 
commutator estimate used to prove the relation (7.6) is due to A. Jaffe — see [Si 2]. 


Section 7.3 The concept of a mixed state was introduced to quantum theory by L. Lan- 
dau and J. von Neumann. The state described by the vector @ = }>, ced, is often called 
a coherent superposition of the states ®1,®2,... in order to distinguish it from the non- 
coherent “superposition” given by the density matrix W = 5°, we, ; in the second case 
we prefer to speak about a mizture of the considered states. 

Given a probability measure ws on (JR, B), we can define its moments m,(S) := 
Spa*dws(x), k =1,2,..., provided the integrals exist. The moment m(S) is called the 
mean value (S) of the random variable S. Furthermore, using the first two moments one 
defines the standard (or mean-square) deviation as AS := \/m2(S) — m,(S)°*. It is easy 
to see that (AS)? = f(x — (S))?dws(x) ; this quantity is called dispersion. 


Section 7.4 The first example of a superselection rule was found by G. Wick, A. Wight- 
man and E. Wigner who deduced from the transformation properties of wave functions 
that the states with integer and half-integer spins belong to different coherent subspaces 
— see | WWW 1] and also [ Wig], Chap.24. In the same paper they also conjectured 
that the electric charge and baryon number define superselection rules; the condition un- 
der which a mixed state is realizable was formulated in [WWW 2]. There is also an 
example of a “continuous” superselection rule: in the nonrelativistic quantum mechanics 
the requirement of Galilei covariance implies that states with different masses belong to 
different coherent subspaces; we shall return to this problem in Remark 10.3.2. 

The assumption about finiteness of the energy mean values for realizable states can 
be found in renown texts such as [SW], Sec.1.1; other authors require even the realizable 
pure states to belong to the domain of the Hamiltonian — see [ BLT ], Sec.2.1.3. The 
fact that such assumptions avoid any experimental verification was discussed in [HE 1], 
[Ex 1], see also [Ex], Sec.1.6. The argument presented at the end of the section applies 
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to any bounded observables A,. In the notation of Section 7.6 this can be expressed as 
\b( An) — bo(Ax)| < €%, where ¢, do are the states described by the statistical operators 
W and Wo, respectively, which means that in an actual experiment we do not determine 
a point in the set S,4 C A* of states but rather some *-weak neighborhood. 


Section 7.5 The inequality (7.16) of Example 8 is valid only for n = 1, 2,3, since (1+h)7! 
does not belong to L?(R”) for n > 4, so we cannot use the factorization trick. On the 
other hand, D(Ho) c L7(R") in this case holds for any g € [2, 2n/(n—4)) , and moreover, 
for any a > 0 there is a b such that ||wll, < a|| Holle + d||v|le for all » € D(Ho) — 
see, e.g., [RS 2], Sec.IX.7. In the terminology mentioned above, the domain of Ho is 
the Sobolev space H®?(UIR"), which consists of all w € L°(IR") for which —Aw makes 
sense as a distribution and belongs to L?(JR”) ; results of the type (7.16) then represent 
examples of so-called embedding theorems telling us which ZL? spaces are contained in a 
given Sobolev space. 

The claim contained in Theorem 9 was formulated for the first time by P. Dirac — see 
{ Dir], Sec.III.4. His argument is valid, however, only for the observables with pure point 
spectra; this is why it is sometimes called Dirac conjecture. The proof for the case when 
S consists of bounded observables was given by [Mau 1]; cf. also | Mau], Sec. VIII.5. 

It is not the full truth, of course, that we know nothing about the state of the system 
before the “first” measurement. We certainly have a definite enough idea about the result 
of the preparatory measurement coming from the theoretical considerations, which guided 
us during construction of the “source”, calibration measurements, etc. This all results 
from a development in which theory mingles with experiment; trying to make a sharp 
distinction would mean producing a new version of the old “chicken or egg” question. 


Section 7.6 The first step towards the axiomatization of properties of quantum systems 
was made by P. Jordan, J. von Neumann and E. Wigner in 1934 — see { JNW 1]. They 
started from the observation that the prescriptions (i) and (ii) define on the set ©, the 
structure of a real vector space and (iii) can be used to define the symmetrized product, 
a-b:= 3((a+b)?—a?— b*) . In addition, the algebra Op is required to satisfy the following 
conditions ( JNW-azioms): 


(j1) af +---+a2 =0 implies aj =--- =a, =0, 
(j2) Zag 
(j3) (at+b)-c=a-c+b-c. 


The structure determined by these postulates is called a Jordan algebra. The mentioned 
authors assumed in addition that ©, has a finite dimension; this allowed them to prove 
the existence of a spectral decomposition for any observable. There is also a classification 
of Jordan algebras; more details and further references can be found in [ Em], Sec.1.2.3. 

The assumption of a finite dimensionality which the authors of [JNW 1] made to avoid 
introducing a topological structure is, of course, too restrictive. Introducing topology by 
a norm, I. Segal in 1947 formulated the following set of requirements (Segal azioms): 


(sl) Op, is a real Banach space with a norm ||- |j, 


(s2) Oy is equipped with the unit element e and the operation a+ a” ; the polynomials 
in the variable a obey the standard algebraic rules, 
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(s3) the map ar a is norm continuous, 
(s4) ||a?— b?|| < max{||a”||, ||b?|]}, 
(s5) |la?|| = |lall?. 


The original paper [Seg 2] also contained the requirement || ace a7|| < || Nacs a?|| for 
any finite subsets R C S in ©, ; only later was it found that this followed from (s1) and 
(s5). The object satisfying these requirements is called a Segal algebra. 

One can define on a Segal algebra the symmetrized product by the prescription men- 
tioned above. In general, it is neither distributive nor associative. If we require it to be 
distributive, then the Segal algebra simultaneously satisfies the JNW-axioms. For asso- 
ciative Segal algebras we have a result analogous to Theorem 6.1.5: any such algebra is 
isometrically isomorphic to the algebra of continuous functions on some compact Haus- 
dorff space — cf. [Seg 2], and also [Em], Thm.1.9. It is clear from this that for quantum 
theory nonasociative Segal algebras are interesting in the first place. 

Let A be some C’*-algebra; then we can easily check that Ag :={aeA: a* =a} 
fulfils the axioms (s1)—(s5); the symmetrized product a:b := $(ab + ba) is distributive. 
A Segal algebra is called special if it is isometrically isomorphic to the set of Hermitean 
elements of some C*~algebra; in a similar way, one defines special real Segal algebras. 
Other Segal algebras are called exceptional; such algebras exist. No “internal” criterion 
is known, which would allow us to decide whether a given Segal algebra is special or 
exceptional; for more details and references see [| Em], Chap.I. 

If we adopt the postulate (ao,), then normal states on A can be represented by 
statistical operators W € Ji(H). This is why the subspace 7? C Ji(H) consisting of 
those W which are reduced by all coherent subspaces is sometimes used as a state space 
— see, e.g., [Da 1], Sec.1.4. It is complete with respect to || - ||; and the states on it are 
represented by the positive elements with the unit trace norm. To distinguish it from the 
state space introduced in Section 7.1, one usually speak about state Banach space. 

The proof that any algebra of the so-called local observables of a free quantum field 
is a type III factor can be found in {[ Ara 1] (see also the notes to Sec.6.3.); the algebraic 
formulation of quantum field theory will be mentioned again in Sec.13.3. C*-algebras 
describing systems with an infinite number of degrees of freedom usually have uncountably 
many nonequivalent irreducible representations of which we actually use a small part 
only, sometimes a single one, and on other occasions a countable family corresponding to 
particular superselection rules. This is made possible by the fact mentioned in the notes to 
Sec.7.4 that an experimental determination of a state yields some *-weak neighborhood 
in A* > S,; at the same time we know from [Fel 1] that pure states corresponding to a 
single irreducible representation are *-weakly dense in S4 — a more detailed discussion 
of this problem can be found in [Haa 1] or in the appendix to [BLT]. A simple example 
of a problem, which involves nonequivalent representations of the algebra of observables, 
is represented by the van Hove model — see [Em], Sec.1.5. 


Problems 


1. Check the spectral decomposition of the operators S; of Example 7.1.2. 
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2. Consider a particle on halfline from Example 7.2.2. 


(a) Let c <0; then the operator Ho,c} {¢c}* is positive. 
(b) The reflection amplitude satisfies |R| = 1. For which values of c is the phase 
shift of the reflected wave independent of energy? 


3. Consider a particle whose motion is confined to a bounded interval J = a, }] of the 
real axis. 


(a) Under which condition can we define on L*(J) self-adjoint operators of mo- 
mentum and kinetic energy corresponding to the formal expressions —i d/dz 
and —d?/dz , respectively? 


(b) Find the spectra of these operators. 


(c) When does the energy equal the square of some momentum operator? 
Hint: Use Example 4.2.5 and Problem 4.61. 


4. Consider the operator H on L?(JR) corresponding to the expression (4.8) with the 
potential V(x) = gr. 


(a) Prove that H is e.s.a. (in fact, self-adjoint) if g > : , and it decomposes into 
an orthogonal sum of operators acting on L?(IR*), respectively. 


(b) Find the self-adjoint extensions of H in the case g € (-1, $) : 


Hint: (a) Find solutions to (H*+%)y~ =0. (b) Cf. [DE 1]. 
5. Check that the operators P;} S(IR%) of Example 7.2.3 are e.s.a. 
Hint: Use the unitary equivalence (7.4). 


6. Let H bea separable Hilbert space with an orthonormale basis {¢n}°2,. Consider 
the operator a defined by a(Son Cngn) = pn VN CnOn-1 with the domain D(a) := 


{Y= Xn tnon : Canlen|? < oo}. 
(a) Find the adjoint a*. 
(b) Check that D(a*a) = D(aa*) and [a,a*|}w = holds for any w € D(a*a). 


7. Prove that the harmonic-oscillator Hamiltonian of Example 7.2.4 can be expressed 


as H = 2a*a+I, where a := 2-'/2(Q4iP). Use this result to check that H is 
self-adjoint and find its spectrum. 


8. Let V be a bounded measurable real-valued function on R such that the limits 


Va := limz—+oo Vi(x) exist; then the operator H := P?+V(Q) satisfies o(H) D 
[v,oo) where v := min{V,, V_}. 


9. Consider the operator P? of Example 7.2.1. Prove that its resolvent (P?+2)-! is 
for Rex > 0 an integral uperator with the kernel G,(z, y) := * eW la) , 


10. Any statistical operator W satisfies Tr W? < 1 ; the relation turns into identity iff 
the state W is pure. If dimH =n, we simultaneously have Tr W? > i, 
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11. Let W bea statistical operator and FE a projection such that Tr(EW) #0; then 


(a) W':=(Tr(EWE))-!EWE is a statistical operator, 


(b) if W is a one-dimensional projection determined by a unit vector ¢, then W’ 
is also a one-dimensional projection and it corresponds to the vector @¢/ := 
E¢/\|EZ¢ll, 

(c) let { Bj: j =1,...,N}, N < 00, bea family of orthogonal projections such 
that Tr(EW) #0 holds for E := \, E; ; then (Tr(EW))-! DX, EWE; 
is @ statistical operator. 


12. Suppose that the system is in a state W and we successively measure the observ- 

"ables Aj,..., An. Prove that w(An, An;.-.;A1,41;W) = TrW,, where W, := 
E,...-E,\WE,...E, and E; := E,,(4;), and that the state after such a measure- 
ment is described by the statistical operator W’ := (Tr W,,)~!W,. In particular, if 
Aj =::: = An =: A, we have w(An,...,A1,A;W) =Tr {E4 (M1 As) w} : 


13. The following definition of compatibility is sometimes used for yes—no experiments 
(which we do not regard here as filters): we perform three measurements ordered 
as E, E2E) ; if the second measurement £ yields with certainty the same result as 
the first one, then EF, and Ez are compatible. Show that this occurs iff EF, and 
E2 are compatible according to the definition from Sec.7.5. 


14. The yes~no experiments E},..., Ep are disjoint if a positive result in some of them 
excludes a positive result in another. 


(a) Prove that this is true iff the projections /),..., EH, are orthogonal; it implies 
their compatibility. 

(b) Specify the conclusions of Example 7.5.5 to the case when Fj,..., Ey, are 
disjoint and extend them to the case of an infinite family of disjoint yes—no 
experiments. 


15. Prove the properties of the spin component operators (7.11) mentioned in Examp- 
le 7.5.4b. Show that the set {51, S2,S3} is irreducible in €25+1. 


16. The operator —A on L?(JR”) is e.s.a. on S(JR”) as well as on C§°(UR”). 
Hint: Check that —A}CS°(IR”) > —A} SUR”) and use (7.4). 


17. Consider the position and momentum operators Q;, P, on L?(IR”) of Example 7.2.3 
and denote Q := {Qi,...-,Qn}, P:= {Pi,..-, Pa}. Let F: R" —C be a Borel 
function. Prove 


(a) f(P) = Fr'f(Q)Fn, in particular, the projection—-valued measures corre- 
sponding to the operator sets P and @ are unitarily equivalent, 


(b) (f(P))(x) = (20)-”/? fan (Fn 'f)(@—y)b(y) dy holds for all pb € D(f(P)) 
provided f € L*(R”). 


Hint: Use (7.4) and Example 5.5.1. 
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18. 


19. 


20. 


21. 
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Extend the result of Problem 9 to higher dimensions: put m = 1/2 in Example 7.5.8 
and use the previous problem to prove that the resolvent (Ho+k7)~' is for Rex > 0 
an integral operator with the kernel 


(a) Gas, y) == i HS (injc—y}) if n =2, where Hi) is the Hankel function, 


—n|>-y| 


(b) Gx(2,9) = Ger n=, 


1-n/2 
(c) more generally, Gx (x,y) := (2m)-"/? (=) - K(nj2)-1(K|z—y|) for n 22, 
where K, is the modified Bessel function. 


Let P,, P2,P3 and Ho be the operators of momentum components and the kinetic 
energy, respectively. Find the probabilities w(A,P;w) and w(A, Ho; 7). 

Hint: Using Problem 18, show that Ex,[0, A) is an integral operator with the kernel 
Ky(a,y) := (2n?)-!(2md)?/22-3(sin z — z cosz) , where z:= V/2mA|zr—y] . 


Let H be separable. For any commutative set S C Lsa(H) one can find a Hermitean 
operator A € B(H) such that SU {A} is a CSCO. 
Hint: Use Corollary 6.5.6. 


Prove the estimate |g(z)—1| < Cz? used in Example 7.5.12. Show that the relation 
(7.17) remains valid under the following weakened assumptions: 


(i) pe PUR*)NL(R®) and fps ly v(y)|dy < 0, 


(ii) we replace the balls by a one-parameter family {U. : « > 0} of neighborhoods 
of the point ko such that diamU, = 2¢ and limsup,_,9 €9(vol(U-))~! < oc, 
where vol(U-) := fy, dk is the Lebesgue measure of U-. 


Chapter 8 


Position and momentum 


8.1 Uncertainty relations 


We know that the outcome of measuring an observable A is the probability measure 
w(-, A;W) provided the system is ina state W. This represents a lot of information, 
and we therefore often use several simpler quantities derived from the measure to 
characterize the result. Most suitable for this purpose are moments of w(-,A;W) or 
their combinations; we have to keep in mind that they are not directly measurable, 
as was pointed out in Section 7.4. The simplest among them is the mean value given 
by (7.7). It tells us nothing, however, of how much the results of the measurement 
are spread. This can be done, e.g., by means of the standard deviation; recall that 
it is defined by 


(AA)w = ((A?)w — (A)2)*? = (f- Arya OW) ) 


The standard deviation represents a way of gauging how exact the measurement is. 
Notice first that for a single observable the precision is in principle unrestricted. 


8.1.1 Proposition: Let 4 € o(A) ; then 


(a) for any € > 0 there is a pure state represented by a vector ~. such that 
|A— (A)y,.| Se and (AA)y, < 2e, 


(b) the identity (AA)w = 0 holds iff Ao := (A)w is an eigenvalue of A and W = 
E,({Ao})W Ea({Ao}). In particular, for a pure state ~ we have (AA), =0 
iff w is an eigenvector of A corresponding to the eigenvalue (A), . 


Proof: By Proposition 5.4.1a, we can choose a unit vector #~- € Ran E4(A-), where 
Az := (A—e,A+€). Without loss of generality we may assume that . describes a 
realizable state; otherwise we choose a coherent subspace H. such that F,w, #0 
and set wi := Eqy-/||EaWel|. The projection E4(A.) as an observable commutes 
with E, so wl € RanE,(A.). The mean value can be expressed by (7.7) as 
(A)ye = IS Ed(we, EL ye) ; it easily yields the estimate A-e < (A)y, < Ate, ie., 
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the first one of the above inequalities. This in turn implies | — (A)y.| < |E—Al+e, 
and therefore 


(AAR, = [CE (Adaa)dle, BE Ye) < 4e?. 


Concerning part (b), the sufficient condition is easy. On the other hand, suppose that 
(AA)w = 0; then the definition of the standard deviation gives wIR\{Ao}, A; W) = 
0, z.e., w({Ao}, A;W) = 1. Denoting E := E,({Ao}) and E’ := I—E we can 
rewrite these relations as Tr(EW) =1 and Tr(E’W) =0, respectively. Using the 
spectral decomposition W = 0, wjE;, Ej :-= Eg, , we get do; w;||E¢;||? = 1 which 
requires ||E@;|| = 1, or in other words, FE; = E; for those 7 for which w, is 
nonzero in view of the normalization condition TrW = 1. It follows that EW = 
WE = EWE. The operator E’WE’ is positive, so the condition Tr(E’W) = 0 
means E’/WE’ =0; together we get EWE=E. Of 


Moreover, this result extends easily to the case when a family of compatible ob- 
servables is measured (Problem 1). On the other hand, noncompatibility means not 
only that the order in which the experiments are performed is important, but also 
that precision is limited when we measure noncompatible observables on identical 
copies of the system, i.e., in the same state. We shall use the standard abbreviation 
for the commutator of two operators, [A1, Ag] := A;A2 — AeA ; recall that due to 
Example 5.5.9 the commutator is zero if A,, Az are commuting self-adjoint opera- 
tors, while the opposite implication is not valid — see Example 8.2.1 below. Then 
the basic result can be formulated as follows. 


8.1.2 Theorem (uncertainty relations): Suppose we measure observables A), Ao 
on the system in a state W. If the operators A;W, A;A,xW belong to the trace 
class for 7,k = 1,2, then the standard deviations satisfy the inequality 


(AAy)w(MAa)w > 5 ITr (é[Ay, Ag]W)| . 


Proof: Without loss of generality we may assume (A;)w = 0 ; otherwise we take 
the operators Aj := A; — (Aj)w , which satisfy the same assumptions and 


(AA;)iy = (AAj)iv = ((A5)?)w, [Ai Ao]W = @[A4, AL]W. 
By assumption, any vector ¢ € RanW belongs to D([Aj, A2|) = D(A1) MN D( Ag) 
and A;¢ € D(Ax) for j,k = 1,2. Then for any real a we have the inequality 
0 < |\(Ai+taA2) dll? = (¢, A?) + a(¢, i[A1, Aa]o) + a?(¢, A3¢) , and therefore also 
Tr (A?W) + aTr (i[Aj, Ao]W) + a? Tr (A2W) > 0; 


this yields the result because Tr (A*W) = (AA,)?,. 


8.1. UNCERTAINTY RELATIONS 295 


8.1.3 Remarks: (a) Operator C := i{A,,A2] need not be densely defined; the 
theorem requires only RanW c D(C). If it is densely defined, it is sym- 
metric by Proposition 4.1.2; however, it can have no self-adjoint extensions 
(Problem 3). If there is a self-adjoint C > C, then the assumed boundedness 
of CW implies CW = CW and the inequality can be rewritten as 


(AAr)w(AAsbw > 5 |(Chw 


(b) If W := Ey is a pure state, then operator AEy belongs the trace class iff 
w € D(A). Hence if ~ € D(A;A,) for j,k = 1,2, the standard deviations 
(AA;)y = (|A;vll? — (b, Aj)?)!” satisfy the relation 


(AAry(AAa)y > 5 ICs ilAr, Addy) - 


As mentioned above, the theorem imposes no restriction if the observables 
A,, Ag are compatible. In other cases the right side may be nontrivial and de- 
pendent, in general, on the state W. 


8.1.4 Example (spin components): Operators S; representing the spin components 
satisfy commutation relations (7.12), so we have 


(ASi)w(AS2)w > 5 |(Ss)wl 


and the similar relations obtained by cyclic permutations of the indices. If W is a 
pure state, for instance, described by an eigenvector of 5; or S2 we have zero on 
the left side , and therefore (S3)w = 0. On the other hand, in the case W := E,,, 
the right side equals 3|m|. 


However, the best-known application of Theorem 2 deals with the position and 
momentum operators. 


8.1.5 Examples: (a) Heisenberg relations: Consider operators Q, P of Exam- 
ple 7.2.1. The operator C' := i[P,Q] is densely defined because its domain 
contains, e.g., the set S(JR), and we can easily check that it is a restriction 
of the unit operator to D(C) := D(PQ)N D(QP) ; hence 


ih 


(AP)y(AQ)y = 5 


holds for all y € D(C) M D(P?) N D(Q?). The analogous relation is valid for 
any mixed state which satisfies the assumptions of the theorem. 


(b) n-dimensional Heisenberg relations: Consider next the operators Q;, Py on 
L?(JR"). Due to Proposition 5.7.3, they commute for 7 # k. On the other 
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hand, in the case 7 = k we can reason as in the preceding example; together 
we get the inequalities 


(AP w(AQy = 55m, 


which are valid if ~ € D(Q5 Pz) 1 D(PiQ5) , where r,s are any non-negative 
integers fulfilling r+s < 2, and the analogous relation for the mixed states 
which satisfy the assumptions of the theorem. We can also introduce the global 
quantities, 


(AQ)iv == (S@, <a (Qs)w)? = 57(AQ;)% 
Ww 


j=1 j=1 


and (AP)%, defined in the same way, which characterize the uncertainty at 
the position and momentum measurement, respectively (Problem 4); under 
the stated assumptions, the Hélder inequality then implies 


(AP)w(AQ)w > 5. 


The position and momentum coordinate operators have purely continuous spec- 
tra, so one can never assign an exact value to them in an experiment. Due to 
Proposition 1, of course, one can measure them with any desired accuracy, but the 
results of the above example tell us this cannot be done simultaneously in the same 
state. It also has consequences for a successive measurement of the two observables 
on the same copy of the system. If, for instance, we perform a high-precision mo- 
mentum measurement on a particle, by Example 7.5.12 the resulting state is well 
approximated by the plane-wave expression in a large spatial region, and therefore 
the probability density of finding the particle there is approximately uniform. Hence 
if a position measurement is performed immediately afterwards, the standard devi- 
ation of the results is large; the larger the standard deviation, the more precise the 
original momentum measurement was. 

Another conclusion drawn from uncertainty relations is where the borderline 
between the classical and quantum mechanics should be placed. Recall that in 
standard units they are of the form 


(APaw(AQs)w > Fb and (AP)w(AQ)w > 2 

respectively. Quantum effects connected with the nonzero value of the right sides 
may be disregarded if the left sides, or more generally all quantities of the dimension 
of action appearing in the description of the given system are large in comparison to 
the Planck constant; in such a case the classical approach is expected to be adequate. 

The reader should be warned that even if the commutator of a pair of observables 
is a restriction of the unit operator, a formal application of the uncertainty relations 
can lead to an erroneous conclusion. 
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8.1.6 Example: When we separate variables in a spherically symmetric problem 
(see Section 11.5 below) we work with the operators on L*(0,2m) representing the 
azimuthal angle and the canonically conjugate momentum (the third component 
of angular momentum). These are defined by Q.: (Qaf)(y) = yf(y) and P, : 
(Paf)(~) = —if’(y) with the domain D(P,) := { f € AC(0,2m): f(0) = f(27)}. 
We know from Examples 4.2.5 and 4.3.3 that both of them are self-adjoint; Qa 
is bounded with o(Q.) = [0,27] while P, has a pure point spectrum, o(P,) = 
{0,+1,+2,...}. By Theorem 2, 


(AP.)(AQa)s > 5 


womistor f = D2) 0 D(P;Q,) = { f € AC*(0, 2x) : f) = fx) =0, 70) = 
f'(2r) }. The danger of formal manipulations is obvious. If we choose f := fm, 
where f(y) := (27)~/*e*"” is an eigenvector of P,, then the left side is zero. 
This is no paradox, of course, because fm ¢ D(PiQa). 


The states for which the inequality of Theorem 2 turns into identity are called 
minimum-uncertainty states. For simplicity, we restrict ourselves to the pure states 
(cf. Problem 5) described by vectors ~ € D(A;Ax), j,k = 1,2 ; we suppose that 
(AA)y(AA2)y = $|(b,C)|, where again C := i[A;, Ao]. This means that the 
quadratic polynomial used in the proof of Theorem 2 has a double root, 


I, Cv)| _ _ (AAr)y 


~ OAS? (aang? 
which satisfies 
(A; — (A1)y + ta(A2 — (A2)y)) y = 0. 
Hence if we choose the mean values and the ratio of the standard deviations, ~ can 
be found as a solution to the last equation. 


8.1.7 Example: Suppose that (AP),(AQ)y = } holds for the operators Q, P on 
L? (JR), and denote p:= (P)y, q:= (Q)y and Aq := (AQ),,. The above argument 
yields a first-order differential equation, which is solved by 


aie). ; 
WWE) = prea ow { GEE + ine - Soa} (8.1) 


(Problem 6a), which is easily seen to belong to D(P"Q™) for n,m = 0,1,2. The 
last term in the exponent corresponds to the integration constant and has no mean- 
ing for determination of the state. Suppose now for simplicity that Ag = 27'/?, i.e., 
(AP)y = (AQ)y = 27'/? ; this can always be achieved by an appropriate choice of 
units. The vectors (8.1) then become 


Yap : Yap(t) = no eS ie 1/2 (8.2) 


which is nothing else than 7, of Remark 2.2.8 provided we put w := $(q—ip) 
(see Problems 6b and 2.12). Hence we may conclude that the minimum uncertainty 
conditions define a family of coherent states in L?(JR). 
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The inequalities of Example 5, sometimes also called the uncertainty princt- 
ple, are probably the most widely known restriction to the results of position and 
momentum measurement, but they are by no means the only ones. We shall de- 
scribe two more, and others will be mentioned in the notes. We again consider the 
operators Q;, P, on L?(JR") and define 


n a/@ n 8/2 
gris (Sas) , (ea 
j=l =1 


for any real a, @ ; as real functions of self-adjoint commuting operators these ope- 
rators are self-adjoint. With this notation, we have the following lower bound. 


8.1.8 Theorem (Bargmann): Let n >3 and up > —2; then 


lori 
]Qethy||? 


holds for any nonzero = € D(P?)N D(Q#t?) . 

Proof: First consider ~ € S(JR") such that (0) = 0 for k = 0,1,2,.... We 
denote r := |x|; then |r~'z,| < 1 gives (P®)y > WiawlO"Q, PO? . ‘The right 
side can be estimated the Schwarz inequality, 


é h |(Q@O,.20j0")". 2 (me ate) 
(Py 22 Tori? owl 


(P%)y > F(n+ ni)? (8.3) 


provided Q#4t+!y #4 0. To express the numerator of the last fraction we use the 
self-adjointness of the operators contained in it; we have Q;Q**!y € S(JR") due 
to the assumption, so (7.5) gives 


=I (Q'Q;Pid, Q"*Y) = 5 (ds (G?+ uQNQ**Y) 5 


substituting this into the last inequality and summing over j, we get (8.3). 
Next we take any ~ € S(UR") different from zero. The norm in the denominator 
then makes sense in view of the inequality 


Qnr/2 


1 
Oe, 1, || 2 > 2 2a+n—-1 2a 2 
Howl? > Tray cna frertdns fae) P ae, 


where the first integral on the right side converges for 2a+n > 0 ; choosing a suitable 
approximating sequence of functions with vanishing derivatives at the origin, we can 
check that (8.3) is valid again (Problem 7). Finally, SR") is a common core for 
the closed operators @*, a > —n/2, and P? (see Problem 7.16), so to any 
py € D(P?)7 D(Q**") we can find an approximating sequence {Yn} C S UR") such 
that P?y), + Py and Q%, + Q%). Wy 
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8.1.9 Remarks: (a) Inequality (8.3) also holds for n = 2 and pp > —2 (see the 
notes). Under appropriately stronger assumptions the norms on the right side 
can be written as mean values (Problem 8d). We can also notice that the 
commutation relations on which the proof is based do not change when either 
of the operators Q;, Py is “shifted” on a multiple of the unit operator. In this 
way, €.g., the left side may be replaced by (AP)2, and the norms on the right 
side by (AQ#/)4, and (AQ#+)2 , respectively — see the following remark. 
The form (8.3) is, however, more suitable if we use the inequality to get a 
lower bound on the kinetic energy. 


(b) In the case » = 0 we recover the uncertainty relations. For 4 = —1,—2 we 
‘get two other frequently used inequalities 
(n — 


(Py > Bo Tiong, cpyy> Sige. 6) 


The inequalities (8.3) tell us that the probability measure w(-,Q;w) cannot be 
too concentrated around some point unless the mean value (P?),, is infinite. This 
can be expressed in a more illustrative way. 


8.1.10 Corollary (a local uncertainty principle): Let n > 3 and w € D(P?) ; then 
w(Us(a),Q;¥) < Cre (AP), (8.5) 


where C,, := 4(n —2)~? and U,(a) := {x € IR": |x—a| < 6}, holds for any 6 >0 
and ae ik”. 

Proof: As we have remarked, one can consider the case with (P;)y =0 and a=0 
only. Choosing » = —2 in the theorem we get the second of the inequalities (8.4) 
for any ~ € D(P?). Furthermore, we denote A := U;(0) ; then the simple estimate 
[6-1 yva(x)| <r7? gives 


IS *xaQWi? < ea ag Py, 


i.e., inequality (8.5). 


8.2 The canonical commutation relations 


The fundamental role in nonrelativistic quantum mechanics is played by the com- 
mutation relations between the operators describing position and momentum, which 
can be formally written as [P,Q] = —7J , or more generally as 


[Pe Qj] = —t6 541 (8.6) 


for systems whose configuration space is IR” (one usually refers to the dimen- 
sion of the configuration space as the number of degrees of freedom — see also 
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Remark 11.1.1a below). We have already employed the identities, (8.6) which are 
called canonical commutation relations (the shorthand CCR is often used), 
several times: to check irreducibility of the set {Q,P}, to prove the uncertainty 
principle, etc. Let us now discuss them more thoroughly. 

The basic problem is the existence and uniqueness of their representation; we 
want to know what the operators satisfying (8.6) may look like. To begin with, 
notice that the relations (8.6) are indeed formal; there are no bounded operators 
which would fulfil them (Problem 10), so they make no sense as operator identities. 
This flaw can be corrected if we replace the original operators by suitable bounded 
functions of them. Recall that self-adjoint operators A,, Ag commute iff the same 
is true for their resolvents or the corresponding unitary groups — see Problem 5.23b 
and Corollary 5.9.4. 

We also know from Example 5.5.9 that if [A1, Az] 40 holds for some 7% in the 
domain of the commutator, then operators A,, Ag do not commute. On the other 
hand, caution is needed if we want to use the fact that the commutator vanishes 
to conclude that the operators commute. To illustrate the hidden danger, suppose 
that the self-adjoint operators A;, Ao are such that 


(a) there is a common dense invariant subspace D, i.e., D =H and A;D Cc D 
for’ 7 = 1,2, 

(b) D is acore for A,, 

(c) A,Aow = A2Aiw holds for any ~ € D. 


Contrary to the natural expectation, these conditions are not sufficient for the com- 
mutativity of A,; and A». 


8.2.1 Example (Nelson): The Riemannian surface of the complex function z > /z 
is two-sheeted, 7.e., its elements are the pairs {2,7} with z € @ and 7 = 1,2. The 
projections z:= x+iy of the points of M to C ~ R? can be used to introduce a 
locally Euclidean metric in M. Globally the topology is more complicated because 


one can pass through the cut {z: y = 0,2 > 0} from one sheet to the other. 
Any function ~ : M — C€ can obviously be expressed as the pair w := {y1, Wo} 
with %,(z) := %({z,7}) ; it is continuous iff the functions 7; are continuous 


outside the cut and lim,47,41mz>0[~;(z) — W3-;(—z)] = 0. Furthermore, we can 
equip M with the measure, which identifies locally with the Lebesgue measure on 
C ~ IR’, and introduce the corresponding space L?(M) which consists of (classes 
of) measurable functions ~ such that ||w||? := fe(ld1(x) |? +|yX2(x) |?) dx dy < oo. 
The subspace D := C§°(M \ {0}) then consists of the functions which have all 
derivatives continuous in the described sense and a compact support separated from 
the point 0 := {0,1} = {0,2}. We define the operators A,, Az on D by 


3) 
Ay i= {i as ie} Aw ={.% it 


it is easy to see that they satisfy conditions (a) and (c). To check that (b) is also 
valid we shall construct the unitary groups generated by the closures A;. 
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First consider the subset D, := {y% € D: suppw does not contain the x 
axes }, which is dense in L?(M) (Problem 11), and define U,(t)y := {vi, v5} 
where Yi(z,y) := ~j(x+t,y). These operators are isometric, preserve D,, and 
form a one-parameter group. Moreover, the map U,(-)w is continuous for ~ € D,, 
so {Ui(t) : t € IR} is a strongly continuous unitary group by Problem 5.45. It 
is not difficult to check that lim:.o(Ui(t)~—w) = itAi~ holds for » € D,, and 
since D, is preserved by U,(t), the operator A, is e.s.a. on it by Problem 5.46; 
the same is then true for its symmetric extension A,. In the same way we define 
the operators U2(s) on Dy := {~ € D: supp does not contain the y axes }, 
representing shifts in the y-direction, 


(U2(s){v1, va})(@,y) = {(—Ks(y))di(z, yt+s) + Ko(y)po(x, y+), 
Ke(y)vi(x,y+ 8) + (1—Ks(y))po(x, y+8) } 


with «s(y) :-= (O(y+s) — O(y))sgns, where © := Yo) is the Heaviside jump 
function. In this case, the operators U(s) can move the support of a function from 
one sheet to the other; repeating the above argument we check that Ae is e.s.a. 

We can now use Corollary 5.9.4; if we assume that operators A,;, Az commute 
it would imply that U,(t)U2(s) = U2(s)U;(t) for all s,t € IR. However, this is 
not true. Choose, for instance, a function ~ € D whose support is contained in a 
sufficiently small neighborhood of the point {1+7, 1} ; then 


U,(2)U2(2)p A U2(2)Ui(2)y, 


because the support of the function on the right side lies in a neighborhood of 
{—1—i, 1} while the left side is supported around {—1 —i, 2}, 7.e., on the other 
sheet. 


8.2.2 Remark: Pathological situations of this type can be prevented if we streng- 
then the assumptions slightly. We have, e.g., the following result (see the notes): 
let the operators A;, A» satisfy the conditions (a), (c) together with 

(b’) D is a core for A? + A? ; 
then D is also a core for operators A; and their closures commute. 


Inspired by these considerations we could try to replace the canonical commuta- 
tion relations stated above by suitable relations between the corresponding unitary 
groups, 1.e., the operators U(t) := e"”* and V(s) := e** for all s,t € IR. To cover 
the n-dimensional case at the same time, we introduce the operators 


U(t) := exp (iy) Pat , V(s) := exp Ox 
hall j=1 


for any s,t € IR”, where as usual s-t := %_,8;t;. In general, these operators 
do not commute because the right side of (8.6) is nonzero; instead, by a formal 
computation, we get the relations 


U(t)V(s) = e**V(s)U(t) (8.7) 
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(Problem 13a), which already make sense as operator identities. We call them 
Wey relations (or the Wey] form of canonical commutation relations). We expect, 
of course, that the relations (8.7) will be satisfied for the standard position and 
momentum coordinate operators. 


8.2.3 Example: Consider again the operators Q;, P, on L?(JR") of Example 7.2.3. 
The corresponding unitary groups are of the form 


(U(t)~)(x) = v(a+t), (V(s)d)(x) = e**4(z) (8.8) 


for all s,t € IR”. The second relation follows from the functional calculus; the 
first was obtained for n = 1 in Example 5.5.1b, while for n > 2 we use the 
decomposition U(t) = []f_, e*”*** , which is a consequence of Proposition 5.5.7. It is 
now straightforward to check that the operators (8.8) satisfy the Wey] relations, and 
moreover, that the operator set {U(t): t € R"}N{V(s): s € JR"} is irreducible 
(Problem 14). Notice that the maps U(-) and V(-) are strongly continuous, so 
we can say that each of the relations (8.8) defines a unitary strongly continuous 
representation of the group T,, of translations of the space JR”. 


We usually refer to these operators Q;, P, and the corresponding unitary groups 
(8.8) as the Schrodinger representation of canonical commutation relations. It 
has a privileged position among all representations of the relations (8.7). 


8.2.4 Theorem (Stone-von Neumann): Let U(-), V(-) be unitary strongly con- 
tinuous representations of the group of translations of the space JR” on a Hilbert 
space # which satisfy the Weyl relations. Then 


(a) there is a decomposition H = ®., Ha such that any of the subspaces Hq is 
invariant with respect to U(t) and V(s) for all s,t € JR", 


(b) a unitary operator S, : H, — L?(IR") corresponds to any a € / in sucha 
way that 


(SoU(t)Sz'y)x) = d(a+t), (SaV(s)Sz'¥)(z) = ef *p(z) 
holds for any L?(IR") and all s,t € IR”. 


In particular, any irreducible (unitary, strongly continuous) representation of the 
relations (8.7) is unitarily equivalent to the Schrédinger representation (8.8). 


Proof: Given U(-) and V(-) , we define a two-parameter family of unitary operators 
by 

R(t, s) := e~**/Y(t)V(s) (8.9) 
for any s,t € IR” (Problem 15). First we decompose the set { R(t,s): s,t € IR"} 


to irreducible components. For any f € L'(IR) and vectors ¢,7) € L?(JR") we 
define 


by(d,¥) = J, Flt, 5) (b, R(t, s)¥) dds ; 
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the right side makes sense because the function (¢, R(-,-)q) is bounded and contin- 
uous due to the assumption. It is easy to see that b,(-,-) is a bounded sesquilinear 
form; hence there is a unique operator By € B(H) such that b;(¢,v) = (¢, By) 
holds for all ¢,7) € H. The map f +» By defined in this way is obviously linear; 
the reader is asked to check its other simple properties in Problem 16. We shall also 
need its injectivity. 


8.2.5 Lemma: By =0 iff f(t,s)=0 holds ae. in R?". 
Proof: If By =0, then also R(—v, —u)B;R(v,u) = 0 for all u,v € R", i.e. 


[.,, Ft.8) (6, R(—v, a) R(t, 8) R(v, w)y) de ds 


= fron On?” tlt, 9) (¢, R(t, s)p) dtds = 0. 
The Fourier transformation is injective (as we mentioned in the notes to Section 1.5), 
so (¢, f(t, s)R(t,s)~) = 0 for all ¢,W € H and aa. s,t € R"”, and therefore 
f(t,s)R(t,s) =0 for a.a. s,t € JR”. Finally, using the fact that operators R(t, s) 
are unitary, we get f=0. &f 


Proof of Theorem 4, continued: Now consider the operator B := By, corresponding 
to the function fo(t,s) := (2r)-"e~+*")/4 which, in view of the lemma and Prob- 
lem 16a, is nonzero and Hermitean. Moreover, the relations (8.7) yield two other 
simple identities (Problem 17); the first of them shows that B is a projection. 

Now we are ready for the decomposition. We choose an orthonormal basis 
{ta : a €I} in RanB, and denote M, := { R(t,s)¥a: s,t € IR”} and Hy := 
(Ma)in . It follows from Problem 17b that M, 1 Mg for a # @, and therefore also 
Ha - Hg ; at the same time this gives dim(H.M Ran B) = 1, so the projection 
B}#H, is one-dimensional. Furthermore, due to Problem 16 R(t,s)M, Cc Mg ; 
hence the subspaces H, are invariant with respect to R(t,s) for all s,t € R”. 
The operator set R := { R(t,s) : s,t € IR"} is symmetric, which means that it is 
reduced by all the subspaces Ha. Suppose that there is a nontrivial closed subspace 
Ga © Ha which is invariant with respect to R ; then its orthogonal complement G+ 
in Ha is also R-invariant. By definition, operator B is in such a case reduced by 
the subspaces G, and G+, so the projections B}G, and B}G+ should be nonzero 
due to Lemma 5. However, this is impossible since their sum is the one-dimensional 
projection B}H,. 

Next we shall check the decomposition H = ®,;H,. Denote by G the or- 
thogonal complement to the right side . For a pair of vectors ¢ € G, ~ € G+ we 
have (w, R(t, s)¢) = (R(t, —s)y,¢) = 0, 7.e.,, G is also R-invariant. If G # {0}, 
then it follows from Lemma 5 that the operator B}G is nonzero; however, it is clear 
from the construction of the subspaces Ha that Ran B Cc G+. Hence we get the 
sought decomposition and the corresponding decomposition of set R to irreducible 
components. 

Let us now ask what are the relations between the irreducible representations 
obtained in this way. Consider a pair of mutually different indices a,@ € I and 
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denote #7, := R(t, s)pa, we, = R(v,u)yp ; then we can define the map Pix: 
M, — Mg by Uv?, = we ;- Due to Problem 17b this map is isometric and 
since the sets M., Mg are total in Ha, Hg, respectively, it can be extended to a 
unitary operator U : Ha — Hg. Using Problem 16 once more we get the relations 
R(t, spl), = efler-se)/ 22) zu for j = a, @, which yield the identity 


U- R(t, s) Up) = etsy 1, = Rit, sv) ; 


extending it to He = (Ma)un we get U-'Rg(t,s)U = R,(t,s), where we have 
denoted R,(t,s) := R(t,s))H,. In particular, we have U~'Ug(t)U = U.(t) and 
U-1V_(s)U = Va(s), which means that any two irreducible representations are 
unitarily equivalent; in combination with the result of Example 3 this concludes the 
proof. 


8.2.6 Remarks: (a) We have proved the irreducibility of the sets R}H. which 
are wider than the corresponding restrictions of {U(t): t € IR™}N{V(s) : 
s € RR"} ; however, it is clear from (8.9) that if G is an invariant subspace of 
the operators U(t), V(s) it is also invariant with respect to R(t, s). 


(b) In the Schrédinger representation, the relations (8.8) give 
(R(t, s)p)(x) = ef O49 Py (x +t) 


for all s,t € IR”. This allows us to express projection B and to show that it is 
one-dimensional (Problem 18). It can be seen from the construction performed 
in the proof that the number of irreducible components of a given represen- 
tation equals dim B ; this again proves that the Schrédinger representation is 
irreducible. 


Operator R(t,s) defined by (8.9) is sometimes called the Weyl operator. More 
often this name is used for 


W(t,s) := R(—t,s) = ef(*O-t?) | (8.10) 


where we write s-Q := 3°5_, 8;Q; etc., which has a rather illustrative meaning (see 
Problem 19). 


8.2.7 Example: Consider the Schrédinger representation with n = 1. Given an 
arbitrary vector ~ € L?(IR) we denote Wp := W(q,p)W. Then it follows from 
Remark 6a that qqp(x) = €'?(?*-9)/24)(~—q) ; the position and momentum appear 
symmetrically here as can be seen from the relation 


(Fygp)(k) = eW 44(2h-P)/2( Fuh) (k—p) , 


which can be checked easily by approximating ~ by functions from S(R). This 
implies the identities 


(Q) v4 = (Q)y +q, (P) bye = (Phy +p 
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for ~ € D(Q)N D(P), while the standard deviations, provided they exist, are 
independent of g,p. In particular, if we take for ~% the harmonic-oscillator ground— 
state vector Yo : Yo(z) = 17 /4e-*"/?,, then the g,» are nothing else than the 
coherent states (8.2). 

As we remarked in the notes to Section 8.1, the relation %_» := W(q,p)w also 
defines coherent states for other ~ € L?(IR). The map (q,p) + Wqp is continuous 
by Problem 14b, so to prove the claim we have to check the relation 


(2) = [ao 9) beo(a) da dp ; (8.11) 


this is not difficult as long as the vectors ~,¢ belong to S(JR") and ||y|| = (27)-1/? 
(Problem 20). 


8.2.8 Remark: Let us see what Theorem 4 implies for the original form of canonical 
commutation relations. If the unitary—-operator—valued functions U(-) and V‘(-) 
satisfy its assumptions, we have 


U(t) = 32° Sz'Us(t)Sa, V(s) = 32° Sz1Va(s)So 
ace! aél 


for all s,t € IR”, where Usg(-) and Vgs(-) are defined by (8.8). The properties of 
the ni (32) representation mentioned in Example 3 imply that the operators 


P, = -t i (0) and Q; :=-i (3) (0) have the following properties: 


(a) there is a common dense invariant subspace, e.g., D := 1%, S51S(R"), 
(b) operators P, and Q; are e.s.a. on D, 
(c) [Pe; Q;]b = —i6;. holds for any ~ € D. 


On the other hand, in the same way as in Example 1 these conditions do not imply 
that the unitary groups associated with P,, Q; satisfy the Wey] relations (Prob- 
lem 21b). As in Remark 2, a sufficient condition is obtained if we replace (b) by 


(b’) the operator 7_,(P? + Q?) is e.s.a. on D; 


notice that in the Schrédinger representation the last condition is valid, e.g., for 
D := S(IR") (Problem 21a). 


We can naturally ask whether Theorem 4 may be extended to situations where 
the number of canonical pairs (degrees of freedom) is infinite. A more general form of 
Wey] relations is obtained if we replace JR” by a real Hilbert space S with the inner 
product (-,-) ; we are then looking for unitary strongly continuous representations 
of the group of translations of S such that 


U(t)V(s) = eV(s)U(t) (8.12) 


for all s,t € S. In the case of a finite-dimensional S , Theorem 4 can be applied, 
since S is then topologically isomorphic to IR” by Theorem 4.1.1. On the other 
hand, no analogous result is valid if dimS = oo (see the notes). 
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8.3. The classical limit and quantization 


Since our global picture of the physical world should be free from contradictions, 
quantum theory has to reproduce the results of classical physics when applied to 
large objects. It is therefore useful to investigate how the predictions of quantum 
theory look in such situations, which are usually referred to briefly as the classi- 
cal limit. We mentioned in Section 8.1 that the results of quantum and classical 
descriptions have to match in cases when quantities of the dimension of action are 
much larger than the Planck constant. It suggests that a suitable mathematical way 
to treat this problem is to study how quantum systems behave in the limit h - 0. 


8.3.1 Example: The classical harmonic oscillator of mass m and angular frequency 
w moves periodically with the amplitude A := (2E/mw*)'/?, where E is the total 
energy; the region outside the interval [—A, A] is classically forbidden. For compar- 
ison with the quantum oscillator we can therefore use the probability of finding the 
particle in a given interval (at a randomly chosen instant), or more generally, the 
mean values of the quantities f(x) (functions of the particle position) with respect 
to the corresponding probability measure. The latter is easily found; the probability 
density of finding the particle at a point x is inversely proportional to its velocity, 
which yields 


(faye =f 2 ae. 
-A mV A?— x? 
In particular, for f := x, the formula yields the classical probability wa(J,2 : E) 
of finding the particle in the interval J provided it has the total energy E. 
Now consider the quantum harmonic oscillator of Example 7.2.4. Suppose, e.g., 
that the function f is measurable and polynomially bounded; then the mean value 
of the observable f(Q) in the state 7%, is by (Q2c) equal to 


Doe = pry fe fee) ta (\[) mea. 


To be able to compare the two cases we have to keep in mind that the energy4 
eigenvalue corresponding to state ~, depends on fA ; hence it is necessary to perform 
simultaneously a suitable limit n — oo. We choose, for instance, the sequence {hn} 
in such a way that $h,w(2n + 1) = E and see how the quantity 


(F£(Q))anitin = een a (st Hg) e a dy 


behaves in the limit n > oo. Choosing f : f(x) = cos(kx) we get 


pa? aA 
1 k ages 
jim, (cos( Oa = lim oe ( 5) exp ( c“) Jo(kA), 


where Jo is the first-order Bessel function (Problem 23). On the other hand, 1 
have (cos(kx))~ = Jo(kA), and therefore 


tim, (f(Q))entin = (F(2))e (8.1: 
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for f(x) := cos(kx). The same is true trivially for the odd function f(r) := 
sin(kx) , and thus also for f(x) := e'** ; we shall show that the relation (8.13) is 
valid for any f , which is the Fourier transform of some if € L1(R), in particular, 
for all f € S(IR). Under this assumption the Fubini theorem gives (f(xr))z = 
(2r)-!/? fp f(k)Jo(kA) dk. On the other hand, the same theorem can be applied to 
express the quantum mean value, 


Jim (F(Q))onne = fits, ee [ARF re | exp ( “eat | Haye dy. 


The integrated function in the outer integral is majorized by | if (-)| and tends to 
f(-)Jo(-A) as n — oo ; the result then follows from the dominated—convergence 
theorem. 


In a similar way we can deal with the classical limit for other simple systems 
(Problem 24). A more complicated problem is to describe how quantum systems be- 
have when they approach the classical limit, i.e., how their wave functions, eigenval- 
ues, etc., depend on the value of the Planck constant; we usually speak of the semi- 
classical approximation. A straightforward approach, the formal version of which 
is well known from quantum mechanical textbooks, is based on expansion of the 
considered quantities into series in powers of fh ; it can be put on a mathematically 
sound basis, but it requires a certain effort. We give some references in the notes; 
the problem will be mentioned again in Section 9.4. 

A converse to the classical limit is represented in a sense by quantization 
which is, roughly speaking, a search for the quantum description of a system based 
on knowledge of its classical counterpart. This problem has an obvious heuristic 
motivation: if we are looking for operators to represent observables of a system 
having a classical analogue, it could be useful to have a prescription for constructing 
them starting from the corresponding classical quantities. On the other hand, the 
importance of a quantization as a physical method must not be overestimated; the 
ultimate criterion of adequacy of the quantum model obtained in this way is the 
correctness of its predictions. 

Consider a classical system with phase space JR", in which the canonical coor- 
dinates q1,.-.,Qn)P1,---;Pn are introduced. The starting object is a suitable family 
F of functions f : IR — IR which could be required, for example, to obey the 
following conditions: 


(cl) F includes the trivial observable, 1(q1,...,@n,P1,---)Pn) := 1 for any 
(iy Pees Pi) € JR", as well as qj: hGis »« Gay Pimang Py) ay 
and pe: De(Q1,+++)Qn)Pir+++>Pn) = Pk 


(c2) F is a real vector space; in particular, it contains the (2n+1)-dimensional 
subspace Fy := { f = a1 + Df.1(895 + 505) : a, 8j,9, € R}, 


(c3) F is a Lie algebra (usually an infinite-dimensional one) with respect to the 
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product {-,-}p defined by the Poisson bracket, 


_ oe (Of 99 — og sh) 
tighe = X (en Op; Og; Op; 


using the terminology of Section 10.3 we can say that its subalgebra F} is just 
the Heisenberg—-Wey] algebra. 


The conditions are satisfied for various families of functions, for instance, the set 
Won of all inhomogeneous polynomials in the variables q;,p,, the set C®(IR™) of 
all infinitely differentiable functions, etc. 

The most common quantization procedure, which is usually referred to as Dirac 
quantization, consists of mapping this algebraic structure to a suitable family of 
self-adjoint operators: given a set F' of classical observables we look for a Hilbert 
space H and a linear map J: F — L,,(H) such that 


(dl) J({f,g}p) = (@h)-*[J(f), J(g)] holds for all f,g € F, 
(d2) J(1) =I, i.e., the unit operator corresponds to the trivial observable 1, 
(d3) the set { J(q;), J(pe) : j,k =1,...,n} is irreducible; 


the inclusion F > F; shows that a part of the problem is to construct a represen- 
tation of the canonical commutation relations. This means, in particular, that the 
set J(F) cannot consist of bounded operators only, and condition (dl) has to be 
understood as being valid on a common dense invariant subspace. 

We know from the preceding section that amap J with the properties (d1)—(d3) 
exists if F' = F,, and even that it is not unique unless we add other requirements 
such as those discussed in Remark 8.2.8. However, the set of classical observables 
also contains other quantities, such as the kinetic and potential energy, angular 
momentum, etc., which should be included into F’. Unfortunately, it appears that 
F, cannot be extended too much. 


8.3.2 Theorem (van Hove-Tilgner): There is no Dirac quantization on the sets 
CUR") and Way. 


References to the proof are given in the notes. Here we limit ourselves to hinting 
why such a map J cannot exist; for simplicity we consider the case with a — 
degree of freedom, n = 1. Suppose that the self-adjoint operators Q := J(q), P 

J(p) correspond to the classical observables gq, p, respectively; then the ‘elena 
(n+1)q” = {q"t',p}p together with condition (dl) implies Q* = J(g") for all 
positive integers n and the analogous relations for P"™. Furthermore, the relation 
(n+1)q"p = {q"t',p?}p and (d1) give J(q” ie $(Q”P+PQ") and similar relations 
for J(qp”). Now consider a — q’p”, which has more than one Poisson— 
bracket expression, e.9., q?p = 1{q Pp 198 p,p’}. Using the first expression, 
after an elementary computation we get from (d1) 


J(q?p*)p = [Q?P? — 2ihQP + a(-ih)*|p 


8.3. THE CLASSICAL LIMIT AND QUANTIZATION 309 


for any vector 7% of the common invariant domain where a = 2/3, while the second 
expression yields a similar relation with a = 1/2 (Problem 25). 

As we shall see in a moment, there are sets W., D F' D F, for which a Dirac 
quantization can be constructed. However, these sets are too small to meet practical 
needs; it is therefore more reasonable to change the original question abandoning 
the requirement (c3) on the family of classical observables. 


8.3.3 Example (practical quantization): For simplicity we shall again consider the 
case n = | ; an extension to systems with any finite number of degrees of freedom 
is easy. The subsets 


Fy = | > ang’ p* : aj € r} » Fy := {91(9)p + 90(g) : go, 91 € C~UR) }¥W 
j+k<2 


in C®(R?) satisfy conditions (cl)—(c3); we define on them the maps 


In 4( 5 on = 5D anlQPr+ Ph), 


j+k<2 j+k<2 


I 


Jy: Jy(u(@p + .90(0)) = 3(91(Q)P + Pax(Q)) +.90(0), 


where Q,P are the position and momentum operators in the Schrédinger repre- 
sentation restricted to a suitable dense invariant subspace D Cc L*(JR). We can 
choose, e.g., D = C§°(UR) ; in the first case S(JR) can also be used, while for J, 
it is generally not invariant if the functions g; grow too fast. It is straightforward 
to check that the two maps satisfy conditions (d1)—(d3), and therefore each of them 
could be considered as a Dirac quantization — up to the fact that the obtained op- 
erators are in general only symmetric. A more elementary and substantial reason, 
however, why neither of them is satisfactory is the following: they do not contain a 
typical energy observable of the form p*+u(q) with exception of the case when v 
is a quadratic polynomial. 

On the other hand, Je(f) = J,(f) holds for any f € FM Fy, so we are able 
to construct a common extension. We take the subset F2, in C™(R) spanned by 
Fy Fy, t-€., Fog = { gop? + 7(9)p + 90(p) : 92 € R, 91,90 € C~(IR)}, and define 
the map Jo, by 


ala) = 92P? + 5(91(Q)P + Pou(Q)) + 90(@)- 


The set Fo, already does not satisfy condition (c3): we have seen, for instance, that 
{q?p, p?}p = 6q?p? ¢ Foq. If the functions f,g € Fo, are such that {f,g}p € Faq, 
then, of course, condition (d1) is satisfied; for this reason the map Jog may be 
called a practical quantization. Notice also that the operators obtained in this way 
are symmetric but in general not e.s.a. (see the notes). 


To illustrate that the quantization procedure described here has a rather heuris- 
tic meaning, consider again a typical Hamilton function, h(q,p) := p?/2m + v(q) 
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with v € C°(IR). We ascribe to it the operator H := Jz(h) = (2m)-! P?+ v(Q). 
However, we also postulate the same form of the Hamiltonian for other potentials 
including those for which function v has discontinuities and other singularities; 
we care only about the essential self-adjointness of the resulting operator. At the 
same time, the relations between classical and quantum dynamics include some deep 
problems — see the notes. 


Notes to Chapter 8 


Section 8.1 In Example 7 we again encounter the coherent states already mentioned in 
Sec.2.2. This name is used for various subsets C := {wz : z € I} of the state space H, 
where J is an uncountable index set equipped with a suitable topology and measure. It 
is required that 


(i) the map z++ yw, is continuous, 
(ii) a relation of the type (2.2) is valid: w= f;(Wz, wv). dz forall we H. 


The family (8.2) of coherent states can be obtained from the harmonic—oscillator ground— 
state vector %% by means of the operators of the Heisenberg-Weyl group representation 
— see Example 8.2.7. If we replace Wo by another vector y € L?(JR) we obtain another 
family of coherent states which are no longer minimum-—uncertainty states and cannot be 
represented by analytic functions. 

The family (8.2) was first noted by E. Schrédinger in [Schr 1]. The notion of a 
coherent state was formulated at the beginning of the sixties in the papers [ Gla 1,2], 
[Kla 1]; the Hilbert space of analytic functions, which is closely related to the states (8.2), 
was introduced at about the same time — see [Ba 1,2]; [Seg], Chap.VI, and [Seg 3]. 
The mentioned coherent states are often called canonical because they are determined 
by the HW-group related to the CCR. Coherent states can also be constructed for other 
groups; a well known example are the spin (or Bloch) coherent states in the spaces 028+! 
which were constructed in [Rad 1] using the appropriate representations of the group 
SU(2) ; the construction of coherent states for a general Lie group can be found in [Per 1}. 
As we have already noted (and as we shall see again in Sec.9.3) the canonical coherent 
states are naturally associated with harmonic oscillator potential; similar “coherent” states 
corresponding to a wider class of potentials have been studied in [NSG 1]. 

Canonical coherent states were introduced by R. Glauber for purposes of quantum 
optics; a little earlier they were defined by J. Klauder as an “overcomplete system of 
states”, as a tool to define the Feynman integral — see [Kla 2] and also [KD 1]. As 
was already noted by E. Schrodinger, coherent states are useful for investigation of the 
classical limit of quantum mechanics; see also, e.g., [Hep 1], [Lie 1], {Hag 1]. In addition, 
they have physical applications which are too numerous to be listed here; a representative 
overview is provided by the book [KS], where many important papers are reprinted. 

Uncertainty relations are probably discussed in every quantum mechanical textbook; 
sometimes they are illustrated by an argument adopted from classical optics whose rigorous 
form can be found in [Wil 1]. The fact that standard deviation is not always the best way 
to characterize how the state is localized has inspired various modifications. For instance, 
given numbers a, 6 € [1/2,1) we can define (6°Q)y := inf{b—a: w((a,),Q;~) = a} 
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and (gQ)y := inf{|a|: fp v(x)v(x—a) dx = B} (cf. Problem 8) and the analogous 
quantities for operator P. They have an obvious meaning; it is illustrative to inspect 
what they look like if the support of the function w consists of two intervals of a length | 
at a distance L >. Various inequalities between such “uncertainties” have been proven 
in [HU 1], [UH 1,2], [Pri 2]. 

Theorem 8 was proven in the paper [Ba 4] using the partial-wave decomposition 
L?(IR") = @z2o He, where He are eigenspaces of the generalized angular momentum (cf. 
Section 11.5). In this way, a stronger result is obtained: if ~ € He, then n+ yp in the 
inequality (8.3) can be replaced by n+ +2. A generalization to the case when QHUt! 
is replaced by g(Q) with an absolutely continuous g: IR*+ — MRR is given in [Ex 2]. A 
concise proof of the second of the inequalities (8.4), which is sometimes referred to as the 
uncertainty principle lemma, can be found in [RS 2], Sec.X.2. 


Corollary 10 is adopted from the paper [Far 1], which also contains an alternative 
proof showing that the bound can be improved for 3 < n < 5, when the constant C, can 
be replaced by 4/m?, and also an example illustrating that the assertion is not valid for 
n= 2. It is elementary to check that for n = 1 it is not valid either; in that case we can 
prove a similar relation with the right side depending linearly on (AP)y (Problem 9). 
More general inequalities of that type can be found in [Pri 1,2]. 

Theorem 8 and Corollary 10 represent examples of results sometimes called local un- 
certainty principles . They are deeper than the standard Heisenberg inequalities because 
they show that localizing a particle (or a system of particles) sharply at any point of the 
space is possible only at the expense of raising its kinetic energy. This increase may or 
not be compensated by potential energy; this is the core of the fundamental problem of 
stability of matter about which we shall say more in the notes to Sec.14.3. 


Section 8.2 Example 1 comes from the paper [ Nel 1]; see also [RS 1], Sec.VIII.5; 
{ Thi 3], Sec.3.1; basic facts about Riemannian surfaces can be found in most complex— 
analysis textbooks — see, e.g., [ Sab ]. In the mentioned paper E. Nelson also proved a 
sufficient condition for integrability of representations of Lie algebras — cf. also [BaR], 
Sec.11.5 — the assertion quoted in Remark 2 is a particular case of this result for the 
two-dimensional commutative Lie algebra. The idea is the following: we can expand the 
involved unitary groups into power series and employ the commutation relations between 
the generators. This makes sense provided the operators are applied to a common analytic 
vector; the type (b’) condition ensures the existence of a dense set of such vectors. Another 
particular case of Nelson’s theorem is given in Remark 8; see also [Di 1]. A simpler example 
of operators Q, P which satisfy the relations (8.6) on a dense set but (8.7) does not hold 
for the corresponding U(t), V(s) can be found in Problem 12; however, in that case 
condition (b) is not valid. 

Canonical commutation relations were put in the form (8.7) by H. Weyl in 1927; the 
result concerning uniqueness of their irreducible representation, i.e. Theorem 4, was first 
proven in the papers [Sto 1] and [vN 1]. Our proof essentially follows the original von 
Neumann argument which is reproduced, e.g., in [| Pru], Sec.IV.6; other modifications 
and generalizations can be found in [ Hol], Sec.V.3; [ Thi 3], Sec.3.1; [Kas 1], ete. A 
somewhat different proof, which relies on analytic vectors of the operator P?+ Q?- TI, 
is sketched in [ RS 2], Probl.X.30. Theorem 4 represents a particular case of a general 
result proven by G. Mackey within the so-called imprimitivity theory — see, e.g., [BaR], 
Sec.20.2; [Ja], Sec.12-3; [ Var 2], Sec.11.3. Properties of operators of type i[f(P), 9(Q)] 
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in the Schrédinger representation are discussed in [Ka 7]. 

A discussion of the problem of nonequivalent representations of the relation (8.12) 
for dimS = oo can be found, e.g., in [Seg]. Generally speaking, there are plenty of 
them. If we consider, for instance, operators describing a free scalar field together with 
the canonically conjugate momenta, then any two families of such operators corresponding 
to fields with different masses define nonequivalent representations of (8.12), and moreover, 
representations corresponding to the free and interacting field are also not equivalent — 
we shall return to this question in Sec.12.3. 


There we shall also encounter canonical anticommutation relations (CAR). In this case 
the situation is easier because only bounded operators are involved. The anticommutator 
is conventionally defined as {B,, Bj} := B,B2+B2B, ; we say that bounded operators 
B;, j =1,...,N, satisfy the CAR if {B;, B.} = {B?, By} =0 and {B}, By} = b;41 . If 
we look for a representation of these relations it is useful to pass to 2N unitary operators 
Uo; = B; + B; and Ugj-1:= i( B} — B;), which satisfy ug =vieand Uglinge Url w= 0 
for n#m. For any positive integer N one can construct an irreducible 2 -dimensional 
representation of the CAR which we denote as U (NY) (Problem 22). A counterpart to 
Theorem 4 is provided by the Jordan-Wigner theorem: If N < oo, any irreducible repre- 
sentation of the CAR is unitarily equivalent to U‘%) — see, e.g., [Si 1]. 


Section 8.8 In addition to the correct classical limit, which is required if the quantum 
theory is to be consistent with classical physics, the former has numerous implications 
for macroscopic systems which the latter can register but in no way explain. Recall the 
problem of stability of matter mentioned above, which suggests that the mere existence 
of our esteemed reader is a quantum effect to which classical physics has little to say, and 
a fortiori, the fact that he or she is living and is reading this book represents a quantum 
phenomenon so deep that nobody is able to render its full explanation. 

The formal scheme of the semiclassical approximation or WK B-method in which loga- 
rithm of the wave function is supposed to be a meromorphic functions of the “variable” 
h at the vicinity of the point 4 = 0 is known from textbooks. A detailed discussion 
together with many applications can be found in [BM 1]; see also [ Mas], | MF], and 
[ Vai]; one way in which the WKB expansion can be made rigorous is described in [KIS 1]. 
Semiclassical approximation has been object of intense mathematical interest. We have 
already touched the problem in the notes to Secs.6.7 and 8.1, see also [GMR 1]; another 
approach is used in e.g., [CDS 1], [CDKS 1], and yet another one will be mentioned in 
the notes to Sec.9.4. Let us remark that there are situations where semiclassical analysis 
can be used; however, the expansion parameter has another physical meaning — as an 
example, the cascading phenomenon studied in [GGH 1] can be quoted. 

The idea of quantization procedure goes back to P. Dirac — cf. [ Dir], Sec.IV.21. 
The proof of Theorem 2 for F = C™(iR) is given in [vH 1] or [ vH J, Sec.23; for 
F = Wo, it can be found in {Til 1]. Notice that operators resulting from practical 
quantization are generally only symmetric; an example is provided by A := P?+ Q?—Q?, 
which is not e.s.a. even on Co°(IR) — see [RS 2], Sec.X.5. As we have mentioned, 
an extension of the map Jz, is possible at the expense of losing the bijective correspon- 
dence between the Poisson bracket of the classical observables and the commutator of the 
corresponding operators. Nevertheless, different extensions have been constructed. They 
can be distinguished by the rule, which ascribes an operator to the quantity q/p* , for in- 
stance, 3(Q/P*+ P*Q?) (symmetrization), (j+1)7! 32,_) Q™P*kQi-™ (Born-Jordan), 


m=0 
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2-3 3? 9 (A) Q™PFQI-™ (Weyl-McCoy), etc.; an overview is given in [Wol 1]. 

More information about various quantization methods can be found, e.g., in [BFF 1] 
or [ Hur]. Physically this problem is usually motivated as an attempt to find a way 
to produce a correct quantum description of more complicated classical systems (with 
constraints, a nonflat configuration space, field theories, etc.) but no convincing exarnple 
has been found up to now. At the same time, the problem of quantization has been a 
source of inspiration for mathematics. Moreover, there are deep relations between the 
spectral properties of quantum systems and the phase space behavior of their classical 
counterparts which are not yet fully understood. For example, in so-called integrable 
systems the phase-space trajectories consists of families of tori, while in chaotic systems 
they are rather irregular; in their quantum counterparts the difference is manifested in 
different distributions of energy-eigenvalue spacings and similar quantities. However, a 
discussion of this problem goes beyond the scope of the present book; we refer to [Zy 1], 
[Seb 1] for further reading. 


Problems 


1. Let Aj,...,An be compatible observables. Show that if A; € o(A;), j =1,...,n; 
then for any € > 0 there is a pure state we such that |A; — (A;)y,| < € and 
(AA;)y, < 2€ holds for 7 = 1,...,n. In a similar way, generalize assertion (b) of 
Proposition 8.1.1. 


2. Under the assumptions of Theorem 8.1.2, (AA1)?, + (AAo)%, > |Tr (é[A1, Ao]W)|. 
In particular, operators Q;, P, on L?(IR”) satisfy (AQ;)% + (AP,)%, > 6jx for any 
we SUR”). 

8. Let A;, Ag be unbounded self-adjoint operators; then C' :=i[A:, Ag] may have no 

self-adjoint extensions. 
Hint: Consider operators Hpo,-, Q of Example 7.2.2. 
. Show that (AQ)w of Example 8.1.5b is the standard deviation of the random variable 
1/2 
( aTOyi= (Q;)w)?) : with respect to the probability measure w(-,Q;W) from 
Example 7.5.4a, and that the analogous conclusion is valid for (AP)w . 


MN 


5. Any minimum-uncertainty state is a mixture of pure minimum—uncertainty states. 


6. Prove: (a) The relation (8.1) defines a minimum-uncertainty state. 
(b) Check that the vectors (8.2) coincide with (2.3) for w := 271/2(q—ip). 
(c) Find the minimum-uncertainty states for operators Q;, P, on L?(IR"), i.e., 
functions ~, which satisfy the relation (AP)y(AQ)y = 3. 
7. Complete the proof of Theorem 8.1.8. 
(a) Show that the function Weg: We,a(x) = ef” wz) with w € SUR”) and ¢,8 
positive belongs to S(IR") and “*)(0) =0 for k =0,1,2,.... 
(b) Check that Q%ve,g - Q* as e + 0+ provided a > —n/2. 
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(c) Check that | ||Pjte,all?—- ||P;|]?| + 0 as € + 0+ provided 6 < 1/2. 
(d) For which values of 4 can the norms on the right side of (8.3) be written as 
mean values? 


8 Given w € L?(R) define f(y) := Jp ¥(x)¥(a—-y) dx. The function f is continuous, 


satisfies |f(y)| < f(0), and limp oo f(y) = 0. 
Hint: The shift operators form a strongly continuous group. 


9. Consider operators Q, P on L?(R). Prove: 


(a) if n =1, the inequality (8.5) holds for no constant C;, 
(b) let » € D(P?) and M € B; then w(M,Q;~) < £(M)(AP)y where £(M) is 
the Lebesgue measure of M. 


Hint: (a) If n =1 the functions of D(P?) are continuous. (b) Prove 3( f(Q)y < 
\||(P— (P)all ||f(Q)w|| for an absolutely continuous f and choose the function ap- 
propriately with Ran f c [—1, 1]. 


10. There is no pair of bounded operators Q, P which would fulfil [P,Q] = —7/. 
Hint: Estimate the norms of the left side in [P, Q"] = —inQ™"!. 


11. Prove that sets D, and D, of Example 8.2.1 are dense in L?(M). 


12. The operators Qa, Po from Example 8.1.6 satisfy [Pa,Qa]f =—if for all f for 
which the left side makes sense but the corresponding operators Ua(t) := e'**! and 
Va(s) := e*%2* do not satisfy the Wey] relations. 


13. Prove: (a) The Wey! relations follow formally from (8.6) together with the expansion 
of the operators U(t), V(s) into power series. 


(b) The argument can be made rigorous in the Schrodinger representation provided 
the operators are applied to vectors from C§° (JR). 


Hint: (a) Use the Hausdorff-Baker-Campbell formula (which is valid for bounded 
operators): e'4 Be“4 = oe, S C*, where Co := B and Cy :=[A,Cx_1]. 


14. Consider the operators U(t), V(s) defined by (8.8). Prove that 


(a) they satisfy the relations (8.7), 

(b) the maps U(-) and V(-) are strongly continuous on R”, 

(ChefU(t): te HP} = (Pi,..., Re omdeiVG)casedn”}’ cali... a@a)". 
15. Under the assumptions of Theorem 8.2.4, the operators R(t, s) are unitary, the map 


(t,8) + R(t, s) is strongly continuous, and R(t, s)R(v,u) = e¢4-”)/2R(t40, stu) 
holds for all s,t, u,v € JR”. 


16. Let By be the operators defined in the proof of Theorem 8.2.4. Show that 


(a) (By)* = By, where f*(t,s) := f(—t,—s), 
(b) By, By, = By,,, where fy2(t, 8) := fan fi(t—v, s—u) fo(v, u) eft#-$-)/2¢y du. 
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17. Consider the operator B:= By, from the proof of Theorem 8.2.4. Using the result 


18. 


£9. 


20. 


oi. 


Be. 


of the previous problem show that 


(a) BR(t,s)B = e-+#*)/4B for any s,t € JR"; in particular, B? = B, 
(b) (R(t, s)¢, Rv, u)p) = e~(-2)?/4-(s-u)? +i(e-v-t-4)/27% ) for all ¢, € Ran B 
and any s,t,u,v € JR”. 
Hint: Use the formula fyn e~€7/2)+€-g¢ = (2n)"/2e-7"/2 | 


Prove that the projection B from the proof of Theorem 8.2.4 is expressed in the 
Schrédinger representation as 


(By) (a) = x"? fe Ay (y) dy. 


Show that the condition By = ~ is satisfied only by multiples of the vector wo : 
io(x) = 1-7/4 e-27/2 | 

Hint: Use the orthonormal basis (2.1). 

Prove that the Weyl operator satisfies the relation (8.10). 

Hint: The operator A := s-Q-t-P is e.s.a. on S(IR”) ; check that W(rt,rs) = eiAr 
holds for all re JR. 


Consider the vectors gp := W(q,p)~ of Example 8.2.7. Let ¢, ~ € SUR) with 
lll] = (27)-1/2 and y € L?(R). Prove: 


(a) limeo+ Spe €-°™ (Wa,p, $) Yaio(2) dg dp = 2n|\4|(?9(2) , 

(b) use this to prove (8.11) under the stated assumptions, 

(c) prove the “weak form” of (8.11): (x, ¢) = fin2(X; Yap) (Wap, ¢) dg dp . 

Prove: (a) Let Q;, P, be the operators on L*(JR”) from Example 7.2.3. The 
operator H := 77_,(P? + Q%) is self-adjoint and S(JR”) is a core for it. 


(b) Find an example of operators P, @, which satisfy conditions (a)—-(c) of Re- 
mark 8.2.8 but the Weyl relations are not valid for the corresponding unitary 
groups. 


Hint: (b) In the setting of Example 8.2.1, define Py := —idy/Ox and Qy := 
xy —idw/dy for any ye D. 

Define the operators ey) with n=1,...,2N on the N-fold tensor product 
C? @---@C? in the following way: 

(i) US) :=0n if N=1, 

(it) U2 := 01 @on, n=1,2,3, and U?) :=03@1 if N=2, 

(ii) UMM) := 0, @UN-”, n = 1,...,2N—-2; UMD, := 01 @ 02 @ I) and 

UN) := 03 @ I) if N>3, 

where the o; are Pauli matrices. Prove that these operators form an irreducible 
representation of canonical anticommutation relations (cf. the notes to Sec.8.2). 
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23. Compute the integrals and the limit in Example 8.3.1. 

24. Consider the classical limit for the one-dimensional system described by the square— 
well Hamiltonian of Example 7.2.5, H : Hy = —(h?/2m)(d?p/dzx?) — Vox|-a,0)¥- 
Take the sequence {fin} with hn := 4 ./2m(E + Vo)a® and prove that 

(a) limpoo w(J,Q3 Yn, fin) = Wei(J, 2; E) =  &(JN[—a, a]) holds for any interval 
J C RR, where & stands for the Lebesgue measure, 

(b) the limit of the probability density dw((—0o, 2], Q; Wn, fin)/dx as n — oo does 
not exist in the classically allowed region, |x| <a. 


25. Find the general form of operator J(q"p™) in a Dirac quantization J. 


Chapter 9 


Time evolution 


9.1 The fundamental postulate 


It is a certain idealization to speak about a state at a given instant. We have sup- 
posed up to now that the measurements which determine the state are instantaneous 
processes; however, any real measurement has a finite duration. This fact should be 
remembered: we expect a quantum—mechanical model to yield correct predictions if 
the time scale characteristic for the effects under consideration is much longer than 
the duration of the appropriate measurements. 

Other complications may arise if we study relativistic systems since their evo- 
lution differs when observed from different reference frames. Recall the well known 
example of muons in secondary cosmic rays which reach the Earth’s surface only due 
to the fact that their decay —- which is a purely quantum process — runs at a much 
slower pace for us than in their rest system. This does not mean, however, that 
there is a fundamental difference between relativistic and nonrelativistic systems; 
the description of time evolution we are going to discuss below is universal, at least 
as long we use a fixed reference frame. On the other hand, some notions which are 
useful in nonrelativistic quantum mechanics cannot be transferred simple—mindedly 
to relativistic quantum field theory; we comment on that at the end of the next 
section. 

To begin with, we introduce the notion of a unitary propagator, which is a 
family {U(t,s) : s,t € IR} of unitary operators on a Hilbert space H such that 


Gi) Uées)U (o;r)o= U(t,r)mioldstiomeany rys,tee JA, in particular, U(t,t)=u 
foreach t€ R, 


(ii) the map (s,t) +» U(s,t) is strongly continuous in IR?. 


A statistical operator denoting a state of the system at an instant t will be denoted 
as W, ; similarly we use the symbol y, for a vector describing a pure state at time 
t (if necessary we also write W(t), ~(t), etc.). 

Suppose that the evolution of the system is not disturbed in a time interval J, 
i.e., one performs no measurement during this period. Then we postulate 
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(Q4a) the time evolution of any state of the system is described by a unitary 
propagator, W, = U(t,s)W,U(t,s)~! or % = U(t,s)y, for any s,te J. 


Let us mention briefly how we can motivate the unitarity requirement for the opera- 
tor connecting states at different times. For simplicity, we consider pure states only; 
we assume that at an instant s the state is described by a ray ®,, which evolves 
during [s,t] into ®,, provided it is undisturbed by a measurement. We also forget 
for a moment about superselection rules, in which case any ray can serve as the ini- 
tial state and the relation ©; = U(t,s)®, defines a bijective map in the state space. 
This makes sense also for t < s, when ©, means the state from which ®, has 
developed during [t,s]. Experience tells us that we may suppose that the transition 
probabilities between states are preserved during an undisturbed time evolution, 


P(®,,V,) = P(U(t,s)®,,U(t, s)V,) 


for any ®,, Y,. Then it follows from Wigner’s theorem (see the notes) that there is 
a unitary or antiunitary operator U(t,s) such that q%, := U(t,s)y~, belongs to V; 
for any 7), € W, ; this operator is determined by U(t,s) up to a phase factor. Since 

U(t,s)U(s,r)y, and U(t,r)a, represent the same state, they differ at most by a 
phase factor which is, moreover, independent of 7, in view of the (anti)linearity of 
the operators U, i.e., we have U(t,s)U(s,r) = e"*9U(t,r). Using further the 
associativity of the Jail: multiplication we find 


a(r, s,t) in A(r, s) ate A(s, t) ia B(r, t) ? 


where ((s,t) := a(0,s,t), and therefore we can pass to the operators U(t.s) := 
e~*4(s.) U(t, s) , which correspond to the same U(t,s) and already obey condition (i). 
Combining this property with the natural continuity requirement we see that the 
operators must be unitary (Problem 1). 

The family { U(t,s): s,t € IR} is mostly called briefly a propagator; the term 
evolution operator is also sometimes used. It follows from the postulate (Q4a) that 


TrW? = Trw? (9.1) 


for all s,t € J ; recall that this quantity tells us “how much” the state W; is mixed. 
In particular, a pure state evolves into a pure state again. 

A system is called conservative if its propagator satisfies U(t+r, s+r) = U(t, s) 
for all s,t,7 € JR. For such systems we define U(t) := U(t+ 7,7), where 7 is 
any fixed time instant; the postulate (Q4a) then means that {U(t): t € IR} isa 
strongly continuous one-parameter group of unitary operators. Due to the Stone 
theorem, this group is generated by a self-adjoint operator A. The fundamental 
dynamical postulate of quantum theory consists of identifying the operator —A with 
the Hamiltonian of the system (or with A7'H in the standard system of units): 


(Q4b) the propagator of a conservative system with the Hamiltonian H is given 
by U(t) = e*#* for any te R. 
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An easy consequence is that a quantum system with a time-independent Hamilto- 
nian is conservative. This property is typical for isolated systems; however, those 
which interact with the environment in a time-independent way also belong to this 
class. Furthermore, time evolution of conservative systems has a simple differential 
expression. 


9.1.1 Proposition (Schrédinger equation): If W,D(H) C D(H) is valid at some 
instant s, then the function t + W,¢ is differentiable for any ¢ € D(H) and obeys 
the equation 

d 

ae Wid = [H, W,]¢ 4 

In particular, if %, € D(H) for some s, then t+ 7, is differentiable and 


asl 
tae = Hy;,. 


Proof: The pure-state part of the task is obvious. For any t € R we have 
W,D(H) = U(t—s)W,U(s—t)D(H) Cc D(A) since U(r) as a bounded function 
of H maps D(H) into itself. We have 
d a, U(-6)-I U(6)—I 
ge = Im {eo = 


p+ wie | 


for any ¢ € D(H) ; the limit of the second term exists and equals —iHW;:y due to 
Proposition 5.9.1. As for the first term, we get 


U(—6)—I 
6 


U(6)W, = iwitt6| 


< juceywme) | “—T 9 — ata + Wo@—nWHel, 


and since ||U(5)W,|| <1, the right side tends tozeroas6— 0. J 


Using the standard argument from the theory of ordinary differential equations, 
we can check that for a given initial condition Yo € H, the Schrédinger equation has 
just one solution, namely 7%, = e~*#*yo ; hence in the case of a conservative system it 
uniquely determines its unitary propagator. This correspondence is useful when we 
pass to nonconservative systems whose Hamiltonians are time-dependent. Instead 
of attempting to modify (Q4b), in this case we directly postulate the Schrodinger 
equation: 


(Q4c) the time evolution of a system whose Hamiltonian is generally time-—depen- 
dent is determined by the equations 


iL Wid = [HO,WiId, ite = Hee 


for the mixed and pure states, respectively. 


320 CHAPTER 9. TIME EVOLUTION 


This is clearly consistent with (Q4b); however, we also have to check that the stated 
equations make sense and define a unitary propagator for the considered operator— 
valued function t +> H(t). This represents a nontrivial problem; we shall discuss it 
for some classes of time-dependent Hamiltonians in Section 9.5. 

With knowledge of the time evolution of states we can determine how the mea- 
surable quantities such as probabilities of the measurement outcomes, mean values 
of particular observables, etc., change in the course of time. For instance, suppose 
that AW; € J; for all t from some interval J ; then 


(A)w, = Te(Aet#t-y,efHt-9) 


holds for all s,t € J. Under strengthened assumptions, this relations also admits a 
differential expression. 


9.1.2 Theorem: Let W; describe a state of a conservative system with the Hamil- 
tonian H for all t of some open interval J C IR. The mean value of an observable 
A in state W; satisfies the equation 


© (Aw, = Tr (lH, AW) 
on the interval J provided 
(i) the operators AW,, HAW, , and AHW, belong to the trace class for t € J, 
(ii) the map t+ ||AU(t)¢|| is bounded for ¢ € RanW, if s,t € J, 


(iii) the sum expressing the right side converges in J uniformly with respect to t. 


9.1.3 Remarks: (a) Assumption (iii) is fulfilled, e.g., if dim RanW, < oo for some 
t € J (the dimension obviously does not change with time). This is true, in 
particular, for any pure state 7%, ; assumption (i) then reads y € D([H, A]) for 
all t € J. Assumption (ii) is fulfilled automatically for a bounded observable, 
in which case the derivative on the left side is easily seen to be continuous. 
Finally, the operator i{H, A] is symmetric; if it is also e.s.a., we can rewrite 
the relation as d 


a (Am = (HLA), - 


(b) An advantage of the differential expression is again that it extends to non- 
conservative systems. Formally we get the same relation for them with H 
replaced by H(t) ; to give it a rigorous meaning one has to know more about 
the operator—valued function H(-). 


Proof of Theorem 2: Using the orthonormal basis {¢%} which consists of the eigen- 
vectors of W, corresponding to the eigenvalues w; we can write 

d d 
dt (A)w, = dt » w;(U (t—s)$;, AU (t—s)§) , 


j 
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where s is any point of the interval J. For each term of the series we have 


- (U(t—s)$3, AU(t—s)$%) 


- Hm | — és, Avs) + (405, aS a) | 


6-0 


where ¢; := U(t—s)d§ ; in the second term we have used the fact that ¢{ € D(A) 
due to (i). The first term gives 


; | ( t Av(6)65) — i(Hd5, Ag5) 


= (4 +iH) fs Au(6)é) 


(Ct) 


+ |(H¢%, A(U(6)-1)45)| 


< ||AU(6)95 | + ||AH95ll (76) —-1)¢5 ll — 0 


as 6 — 0 in view of (i),(ii), and postulate (Q4b). The sought derivative is therefore 
—2Im(H¢§, Ag5) = (¢§,7[H, A]¢§) and the assumption (iii) allows us to differenti- 
ate the series term by term. [§ 


As an illustration, let us mention the known fact that in a quantum mechanical 
system of N particles interacting through a potential the mean values of position 
coordinates satisfy the classical equations of motion. Consider a real differentiable 
function V : IR® > R and denote F; := —0V/0x;. Let Q;, Px be the opera- 
tors of Cartesian position and momentum coordinates on L?(JR”) introduced in 
Example 7.2.3; then we define 


Sa ae 
ee = 2 Ta P3+V(Q); (9.2) 
the operators V := V(Q) and F; := F;(Q) are the corresponding functions of the 
family of compatible observables Q := {Q1,...,Qn}. 


9.1.4 Corollary (Ehrenfest theorem): Suppose that SUR”) is a core for H and 
V € C?(R") leaves it invariant. Assume further that %; := e Hts), belongs 
to S(JR") for t from an open interval J C R, and t + max; {||Q;v||, || Pydel|} is 
bounded in J. Then the functions t + (Q;)y, are twice differentiable and satisfy 
the equations 


m= iy, = (Fly, - 


Proof: Since V maps S(JR”) into itself, the identity Fj = VO; —0,;(V%) , where 
0; := 0/Ox;, implies that the same is true for F; ; hence SUR”) C D(F;) follows 
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from the functional calculus. We have Py = —i0;~ for any ~ € SUR”), and 
therefore i[H, Py = Fy and i[H,Q;]~ = m;*Pj. Due to the assumption, 
w € D([H,Q,])N D({H, Pj]) and condition (ii) of Theorem 2 is valid. It yields the 
relations d 


d x 
a (Py, = (Fi), >t (Qi) y, = m;" (Ps) y, ; 
combining them, we get the result. 


The assumptions we have used can be certainly weakened, but since the result 
serves only an illustrative purpose here, we refrain from doing so. The assertion 
provides a motivation for postulate (Q4b): it is substantial for the derivation of the 
“correct” classical relations that we have chosen the operator (9.2) for H which 
appears in Theorem 2. 

In conclusion, let us mention some quantities which play a distinguished role in 
time evolution. Consider again a conservative system with the Hamiltonian H. A 
state W; is said to be stationary if it commutes with the Hamiltonian, W.H C HW, 
for all appropriate t. In particular, a pure state is stationary iff y; is an eigenvector 
of H. Due to the Stone theorem, W; = e~*#(*-s)W,e*#(-s) = W, holds for any 
s,t if W, is stationary. Similarly a pure stationary state, Hy, = Aw,, satisfies 
yy, = e (5), , so the vectors x; belong to the same ray. With this fact in mind 
we usually do not indicate the time dependence of stationary states. 

The mean value of any observable in a stationary state obviously does not depend 
on time. On the other hand, if an observable A is such that the mean values 
(E4(A))w, are time-independent for any realizable state W, of the given system 
and all A € B we call it an integral of motion (or a conserved quantity); the 
definition applies to nonconservative systems as well. If A is an integral of motion, 
the probability measure w(-, A; W,) does not depend on time, so (f(A))w, is time- 
independent for any Borel function f if only it makes sense; this applies particularly 
to (A)w,. For conservative systems we have a simple criterion. 


9.1.5 Proposition: (a) The observable A is an integral of motion iff the operator 
A commutes with the Hamiltonian. 


(b) The total energy of a conservative system is an integral of motion, as is any 
superselection operator. 


Proof: The sufficient condition in (a) verifies easily. Conversely, if A is an integral 
of motion, the identity 


(p, Ut) E4(A)U(—t)p) = (p, Ea(A)d) 


is valid for all t ¢ IR and A € B if w belongs to some coherent subspace H,. The 
operators H, A describe observables, so they are reduced by all Ey ; the same is 
then true for U(t). Mimicking the argument from the proof of Proposition Tiel, 
we infer that E4(A)U(t) = U(t)E,4(A) for all t € IR, A € B, which is equivalent, 
to the commutativity of H and A. Assertion (b) follows easily from (a). [J 


9.2. PICTURES OF MOTION 323 


A family S of observables is called a complete system of integrals of motion if 
any A €S is an integral of motion and no independent ones can be added, i.e., any 
other integral of motion belongs to Si”. A trivial example of such a system consists 
of all Hermitean elements in {H}’; however, we are interested rather in complete 
systems which consist of a few operators only. 


9.1.6 Example: Consider the particle on line of Example 7.2.1, and suppose it is 
free, i.e., H = =P. The parity observable is represented (up to a sign) by the 
reflection operator R: (Rw)(x) = o(—z) ; it obviously has eigenvalues +1 and the 
corresponding eigenspaces H consist of even and odd functions, respectively. The 
subsets Di C H+ of even and odd functions in AC?(JR) are invariant with respect 
to P?, which means that the operator R commutes with H ; in other words, the 
parity is an integral of motion. 

Furthermore, the Hamiltonian can be written as H = H, ®H_, where Hx are 
its parts in the parity eigenspaces. One can check that each of them has a simple 
spectrum (Problem 3); hence the set {H,R} is a CSCO and, at the same time, a 
complete set of integrals of motion for the free particle on line. 


9.2 Pictures of motion 


In quantum theory most predictions are expressed in terms of spectral properties 
of the self-adjoint operators which represent the observables. These properties are 
unitary invariants, and are therefore insensitive to an operation, which replaces all 
operators of the observables by the unitary equivalent operators obtained by means 
of the same unitary operator. It is important in the present context that we may 
use the evolution operator for this purpose. 

The way of describing time evolution discussed in the previous section is called 
the Schrédinger picture. States represented by statistical operators or unit vec- 
tors depend here on time, while operators of observables are time-independent; the 
exceptions are those observables whose time dependence is parametrical, 7.e., coming 
from the environment rather than from the system itself. 

The Heisenberg picture associates with any observable the operator—valued 
function Ay: IR — L..(H) defined by 


An(t) := U(t,s)'AU(t, 8), 


where s is a fixed instant and Ay(s) := A is the operator representing the obser- 
vable in the Schrédinger picture. In common parlance, we again do not distinguish 
between this function and its values, speaking about the observable Ay/(t) , etc. It 
follows from Proposition 9.1.5b that any observable Ay(t) at an arbitrary instant 
t is reduced by all the coherent subspaces. 

Since the predictions of the theory, such as mean values of observables, must 
be independent of the chosen picture, the states are represented in the Heisenberg 
picture by the statistical operators Wy(t) := U(t, s)-!W,U(t, s) or by the vectors 
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w(t) := U(t,s)~'¥. Postulate (Q4a) implies that they are time-independent; we 
write Wi(t) =: W and px(t) =: v. 

If the system is conservative, we easily get the relation between the operators 
representing an observable at different time instants, 


Ag(t) == U(t—7)7'Ag(r) U(t— 17). 


A similar relation is not in general valid for nonconservative systems because the 
propagator need not in this case be a commutative family. As in Proposition 9.1.1, 
the above relation yields a differential form of the equations of motion. 


9.2.1 Proposition: Let A(t) be a bounded observable preserving the domain 
of the Hamiltonian, Ay(s)D(H) C D(H) for some s € JR. Then the function 
Ay(-)¢ is differentiable for any ¢ € D(H) and 


5 Au(t)$ = iH, An(O)]¢. 


It is illustrative to compare this result with Theorem 9.1.2. The norm of the oper- 
ator representing an observable is preserved during time evolution; hence a bounded 
observable remains bounded. For unbounded observables we get formally the same 
equation, but its validity now depends substantially on the relations between the 
domains of H and Ay(t) ; we can, however, study the time evolution of unbounded 
observables using their bounded functions (Problem 4). In the Heisenberg picture 
the integrals of motion are clearly manifested: they are just those observables for 
which the operator—valued function Ay(-) is constant, Ay(t)= A forall te R. 
This applies to nonconservative systems as well. 

The third frequently used picture is the interaction (or Dirac) picture in 
which the time dependence is split between the states and observables. It is used for 
systems with a Hamiltonian of the form H = Ho+ V , where Ho, V are self-adjoint 
operators whose sum is e.s.a.; if the system is nonconservative the decomposition 
is usually chosen so that Ho is time-independent, H(t) = Ho+ V(t). The interac- 
tion picture is obtained from the Schrédinger picture by the unitary transformation 
Wy@)mare*7 | a.e., 


Wo(t) 
wot) : 


where s is again a fixed instant. A Schrédinger observable A is represented in the 
interaction picture by an operator—valued function 


i 


Up(t — s)~'W,Up(t — s) = Uo(s — t)U(t, s)W.U(s, t)Uo(t — s) , 
Uo(t — s)~'v = Uo(s —t)U(t, 8), , 


Ap(t) := Uo(t — s) "A Uo(t — s) ; 


hence the observables carry the part of the time dependence connected with the 
term Hp in the Hamiltonian. In the same way as above, this last relation can be 
easily rephrased in a differential form. 
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9.2.2 Proposition: Let Ap(t) be a bounded observable, Ap(s)D(Ho) C D(Ao) ; 
then the function Ap(-)@ is differentiable for any ¢ € D(Hp) and 


£ A(t) = i[Ho, Ap(t)|¢. 


It is slightly more complicated to express the time dependence of states in a 
differential form. Consider the simplest situation when V is a time-independent 
Hermitean operator; then if W,D(Ho) C D(Ho) or w, € D(Ho) , we have 


i =< Wo(t)d = Volt), Wo(t)I6 


for any ¢ € D(Hpo), and 


iF unlt) = Voltdol®), (9.3) 


where Vp(t) := Uo(t—s)~'VUo(t—s) is the interaction—-picture form of V (Prob- 
lem 5). These equations are also formally satisfied if V is unbounded and time- 
dependent; however, their actual validity has to be checked again separately in any 
particular case. 

Each of the described pictures has its advantages. In quantum mechanics, where 
we typically have a few important observables, the Schrédinger picture is mostly pre- 
ferred. On the other hand, the Heisenberg picture is commonly used in quantum field 
theory, where we have to treat vast families of observables which include, roughly 
speaking, field operators at every point of the configuration space. 

The interaction picture is particularly useful in situations where we are able to 
solve the equations of motion exactly for a part of the total Hamiltonian. In many 
cases the Hamiltonian decomposes naturally into a part Ho, which corresponds to 
a free motion in some sense, and a part V describing the interaction; a typical 
example is operator (9.2) consisting of kinetic and potential energy parts. The free 
problem is often exactly solvable; the interaction picture then allows us to single 
out the part of the propagator related to the interaction Hamiltonian V (hence the 
name). Moreover, if the interaction is weak in some sense, this part of the problem 
can be solved perturbatively; examples will be given in Section 9.5 and Chapter 15. 

The interaction picture is used in quantum mechanics as well as in computa- 
tional methods of quantum field theory. In the last case, however, it cannot be 
generally justified due to the possible existence of nonequivalent representations of 
the canonical commutation relations mentioned in Section 8.2. In particular, for 
any relativistic quantum field theory nonexistence of the interaction picture follows 
from Haag’s theorem, which is mentioned in the notes to Section 12.3. 


9.3. Two examples 


Now we want to discuss in more detail the time evolution of two simple quantum-— 
mechanical systems. First we are going to consider a system of free particles de- 
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scribed by the Hamiltonian — 

ely = ——P ig 

ra 2m; : 

on the state space L?(JR") of Example 7.2.3; in the case of N real spinless particles 
we have n = 3N and M3k+1 = ™M3k4+2 = 773k+3 fier k= 0, 1,. 2,7! . In the 
same way as in Example 7.5.8 we can check that Ho is self-adjoint and has a 
purely continuous spectrum, o(Ho) = JR ; furthermore, C§° is a core for it. The 
corresponding propagator U(t) := e~*#°* can be written explicitly. 


9.3.1 Theorem: Let Ho be the operator defined above; then the relation 


1/ = n 
(U(t)p)(x) = lim. p00 i oe) 7 im exp (5 Emin) wy) f(y) dy 
j=1 


holds for all » € L?(IR"), where {f,} C L?(R") is an arbitrary sequence such 
that |f,(x)| < 1 and limgsoo fe(v) = 1 for a.a. x € IR”. Moreover, we have 
U(t)SUR") c SUR”) - each te R. 

Proof: Choosing yj(xz) := /2m;2z; in Example 3.3.2 we see that Ho is unitarily 
equivalent to P?; — it is sufficient to consider the case with m; = 1/2 only. 
Furthermore, it is eee to prove the formula without regularization because U(t) 
is a bounded operator and wf, — yw. Consider the functions 


Ue 2 Ur = 7 KP (t-te) e>0, 


and u := Uo ; it follows from the functional-calculus rules that U(t) = u(P) = 
s-limeozue(P). The operator on the right side can be expressed using Prob- 
lem 7.17b, and a simple integration yields 


(ue(P)O)(z) = (Ani(e— ie)? fl elle-oP late) dy 


for any € > 0. Moreover, Example 1.2.1 tells us that (ue,(P)w)(x) > (u(P)¢)(zx) 
holds for any w € L?(R") ae. in IR” as k 4 00; if wy € L? NL! the dominated- 
convergence theorem allows us to interchange the limit with the integral, so we 
obtain the sought result, 


(U(e)v)(x) = (Amity? ff eH Mey y) dy 


Finally, the last assertion follows from the fact that S(JR") is preserved by the 
Fourier-Plancherel operator as well as by multiplication by e~#!'?. gj 


In view of Problem 2a, no state of the free-particle system is stationary. The 
explicit form of the propagator leads to a stronger conclusion. 


9.3.2 Example (spreading of minimum-uncertainty states): Suppose that n = 1 
and m, = m, and consider the state (8.2) with some g,p € IR. The vector 
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wt := U(t)p can be obtained by a straightforward computation (Problem 7a); in 
particular, the probability density of finding the particle at a point x equals 


_ 1 l pt “ 
ee(x)|? = von(Aays ep — sa (x-q- * ’ 


where : : 
2 Ap 
-— 2 ” 2 : 
(Aq)s := (Aq)* + (soe =) (Aq)* + ( ) ‘ 


the last identity holds since w is a minimum-uncertainty state, AgAp = < The 
momentum is an integral of motion, so its mean-square deviation is preserved, 
(AP)y, = (AP)y := Ap. On the other hand, the mean-square deviation of the po- 
sition grows quadratically with time; hence 7; ceases to be a minimum—uncertainty 
state immediately, t.e., for any t > 0. 


The conclusions of the example are valid independently of the initial state and 
the number of degrees of freedom involved. 


9.3.3 Proposition: Let a state of the free—particle system satisfy ~, € SUR”) at 
some instant s ; then 


(Qs) = (Q5)y, + uj(t—8), 


(AQ;)%, = (AQs)y, + aj(¢— 8) + bjt — 8)? 
holds for all 7 = 1,...,n and t > 8, where 


1 
U; mj (Pi)y 


[(PiQ3 + Q5Pi)ve — 2Qs)u.(P5)o] | 


1 
a s — 
j ™; 


1 
b; = a (APs: 


9.3.4 Remark: By an approximation argument, validity of the relations extends 
to all states for which the mean values make sense (Problem 7b). The vector v := 
(v1,.--,Un) is called the group velocity of the wave packet. The above relations show, 
in particular, that the identity (PQ + QP)y = 2(Q)y(P)y holds for minimum-— 
uncertainty states. 


Proof of Proposition 3: Put s=0 and w, =: ~. As in Theorem 1 we may consider 
the case m; = 1/2 only, and furthermore, we know from there that %, € SUR”) for 
any t > 0. We shall again employ the relation U(t) = F7!T,F, with u:= el’ ; 
denoting for simplicity u; := Ou/Ozx; etc., we get 


P,T,Fyth = —iuj(Q)Fab + u(Q)P)Fad = u(Q)(P; — 26Q;) Fab. 
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Now Fy!P;F, = —Q; and Fy!u(Q)F, = U(t), so the last relation yields the 
identity Q,;U(t)p = U(t)(Q; + 2tP;)~ ; from here the first assertion follows easily. 
To get the standard deviation expression, we similarly use 


PT Fa) = —uj;(Q)Fnb — 2iuj(Q)P)Fry + u(Q) P? Fa 

u(Q)(4t?Q? — 2it — 4tQ;P; + P?) Fav) 

u(Q)(4t7Q5 — 2t(P;Q; + Q;P;) + Pj) Fn, 

where the canonical commutation relations are involved in the last step (cf. Re- 
mark 8.2.8). We now have to apply F7! to both sides of this equation to get an 


expression for (Q7)y, ; combining it with the first relation we arrive at the mean- 
square deviation formula. 


Physically the most important consequence of Proposition 3 is that in a free- 
particle system localization is not preserved: since the momentum operators have 
purely continuous spectra, and therefore (AP;)y is nonzero for any state of the 
system due to Proposition 8.1.1, 


jim (AQi) vs a 


regardless of the precision with which we determine the coordinates at the initial 
instant. Moreover, the speed of the wave packet spreading is not limited (Prob- 
lem 8); this is connected with the nonrelativistic character of the system under 
consideration. 

Another example that we are going to discuss here concerns the linear harmonic 
oscillator of Example 7.2.4, whose Hamiltonian is 


1 a 
H = — P? + -mw*Q?’. 
2m 2 
Since it has a pure point spectrum, the action of the corresponding propagator can 
be written down easily. 


9.3.5 Example (propagation of minimum-uncertainty states): For simplicity, put 
again m=! = w = 2. The vectors a, of Remark 2.2.8 can be expanded in 
the orthonormal basis {~,} of the eigenvectors of H. Using the unitary oper- 
ator V defined there, together with Problem 8.6b, we easily find ¢y,) = Yy = 


eT WE? Sg O(n!) bn, where w:= 2-"?(q—ip). Then 


~iHty, |wi?/2 w" -it(an41) 
e =e —=e" 

| n=0 vn ms 

and the right side is equal to ed, ea . Since the function ~,, is explicitly known 
we can compute e~*¥*y),, ; in particular, we get 


Pay: 2 1 
\(e Hy, 9)(z)| ~ 5 ce 
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so the wave packet does not change its form and follows the classical oscillator trajec- 
tory with initial position g and momentum p. The simplicity and elegance of this 
argument is manifested when we compare it with the straightforward computation 
used in Example 8 below. 


In this case also the propagator has an explicit integral-operator expression. 


9.3.6 Theorem: Let ¢ # ™, where n is an integer; then the propagator of the 
harmonic oscillator is given by 


(U(W)(e) = Lim. proof Ke(a,y) blu) fel) dy 
with 


ie a ae (oe Pci orn = 2 =|} 
a 271| sin we| a ra me 2 Mens 


for all x € L?(JR), where [-] means the entire part and the regularizing sequence 
{fe} has the same properties as in Theorem 1. 


Proof is left to the reader (Problem 9). 


9.3.7 Remark: For multiples of the oscillator half-period, t = na/w, the kernel 
expression is meaningless but the evolution operator still exists and is proportional 
to a power of the reflection operator, U(na/w) = e~""/?R”. 


Unlike the free—particle case, the harmonic oscillator exhibits no spreading. The 
form of the wave packets can change in time; however, the motion is periodic for 
any initial state because U(t + 2mn/w) = (—1)"U(t) for any te R. 


9.3.8 Example (minimum-uncertainty states revisited): Applying the above ex- 
plicit form of the propagator to the general minimum -uncertainty state w of Ex- 
ample 8.1.7, by a straightforward computation (Problem 10) we get 


(Ap)t | 2  tAg ee 243 ee 
x exp {- by (ea 202 ae sin wt q (Ap) 9 QP / ; 


YAN : ; 
Aq coswt + = sinwt, (Ap): := Ap coswt + imw Ag sinuwt, 


_ _ tg _ PAg — igAp | 
?~ 2(Bq? Aq 


> 
= 
i 


in the last expression we used AgAp = 3. The relations simplify considerably if we 
put Ag = Ap/mw = (2mw)~1/? ; in this case the vector ~ with q = p = 0 describes 
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the harmonic-oscillator ground state and we get, in particular, the expression for 
probability density, 


ebe(a) |? = 4 [— e~mlz—¢ cos wt — (p/mw) sin wt]? , 


Hence we recover the result obtained in Example 5 for m7! = w = 2: the wave 
packets of coherent states preserve their shape and move along the classical oscillator 
trajectories. Let us stress, however, that this is true only for states obtained by 
application of the Weyl operator to the ground state. To make this clear, choose w 
with q=p=0 and Aq # (2mw)-? ; then 


1 2 2 
w z) 2 = ——— a /2(q); 
lv+(x)| Aa 

with 
2 2 2 Ap ae 2 

(Aq); := (Ag)* cos* wt + (—*) sin* wt , 


i.e., the wave packet is “breathing”, having the Gaussian form of a periodically 
changing width. 


9.4 The Feynman integral 


Consider again the system of particles with a potential interaction which is described 
by the Hamiltonian (9.2). For the present moment we are not going to specify the 
assumptions concerning V ; we suppose that H is e.s.a. leaving to Chapter 14a 
discussion of the problem for which classes of potentials this is true. = 

Our aim is to derive a useful expression of the propagator U(t) = e~*#*. We 
employ the fact that H is a sum of two operators, H = Ho+V. The first of 
these is the free Hamiltonian discussed in the previous section, whose propagator 
is explicitly known; the other is a multiplication operator so that e~*”* is found 
trivially. With knowledge of the two operator families, we can express the sought 
propagator by means of Theorem 5.9.7, 


_ —tHot/N ,-iVt/N \N 

U(t) g lim (e e ) ; 

Hence U(t)y = limyieo yl) for any ~ € L*(IR"), where the approximating se- 
quence is defined recursively by y) =~ and 


N i 4 - 
[NY sae UHMHIN ie NRO... . 


We choose, for instance, the characteristic functions of the balls B; := {x € IR”: 
|z| < 7} as the regularization sequence {f;} in Theorem 9.3.1; then by induction 
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we get 
n . \N/2 
(N)(p) — (==) 
x)= 9.4 
Hc) = TT (sae) (9.4) 
Ovo EFS (y rest) ay) dy... dyN—D 


By, X...XByx 
for a.a. x € IR® (Problem 11), where we have denoted y%) := 2, 6y :=t/N and 
(0) (W) LS mH [HD _ () 
Sw (y,...0y it) = D 35, — yl — Vy) by | 
= j= 


It is easy to see that we can use any other regularization procedure with the proper- 
ties specified in Theorem 9.3.1. Hence we arrive at the following conclusion. 


9.4.1 Theorem: If the operator (9.2) is e.s.a., then the corresponding propagator 
is given by (9.4) for any = € L?(JR"). 


What is the meaning of this result? Consider the system of classical particles 
whose dynamics is determined by the Hamilton function 


nm 


haq.p) = = +V(q); 


j=1 


where as usual q := (q1,...,@n) and p := (pi,...,~n). For any trajectory y : 
[(0,t] + JR” of this system we can define the action 


so) = [ (5 Yo msis(e)? vi) ds 


In particular, if y(-) = y(y,...,y;-) is a piecewise linear path whose graph is 
the polygonal line with vertices at y") = y(kt/N), k=0,1,...,N, then we have 


N=1 

k+1 a ae 

a Hy, ...,9%; 8? dos So [yr —y®/ oy, 
k=0 


i.e., the first terms in the expressions of Sy (y, ey );t) and Sy(y) coincide. 

This is not true for the second terms but their difference is in most cases small 
for large N (Problem 12). This suggests that the limit in (9.4) might not change 
if we replace Sy (y, ey );t) by S(7) ; in other words, we conjecture 


" of 


ePy it) af (yO) dy re dyN~}) 


Bs, XX Bz ny 


(ewe) = Jim. TI (gris 
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for a.a. x € IR". It appears that this relation is indeed valid for wide classes of 
potentials; however, the corresponding proofs are not easy with exception of the 
simplest cases (cf. Problem 13 and the notes). 

Hence (U(t)~)(x) is a.e. approximated by expressions, which may be inter- 
preted as the integral of ++ eu(7(0)) over the set of the polygonal paths that 
are linear in the intervals (kéy,(k + 1)6n), k =0,1,..., NM —1. Since any contin- 
uous trajectory can be approximated by such paths, it is natural to ask whether 
(U(t)~)(x) could be calculated directly, without the limiting procedure, as an inte- 
gral over the set of all continuous paths ending at point z, 


(Uw) (a) = ff o((0)) Dy. 
o(t)=z 

This idea belongs to R. Feynman, after whom these expressions are usually named. 

It should be stressed that the object on the right side is only formal, in particular, 

because the Lebesgue measure has no counterpart in the path space in view of its 

infinite dimension (Problem 14). 

Before we discuss the meaning of the right side in the last formula, we should 
briefly mention its appealing properties. The Feynman expression of the propagator 
makes the relations between quantum and classical mechanics very illustrative. In 
the standard system of units, the formula reads 


(UUYe) = [eSB (4(0)) Dy. (9.6) 


If the Planck constant h is sufficiently small compared with the action involved, the 
integral can be treated formally by the stationary-phase method. In this way, we can 
conjecture that the contributions to the integral cancel with the exception of those 
coming from the vicinity of stationary points y. of the function S. It is known from 
classical mechanics, however, that the action is stationary just for the paths that 
solve the equations of motion. This means that only trajectories near to the classical 
ones contribute significantly to the value of the Feynman integral. Moreover, the 
formal stationary-phase evaluation of the integral yields the well-known semiclas- 
sical approximation to wave function, with the leading term containing the factor 
e(/F)S(7ct) | This heuristic argument admits a rigorous formulation; we comment on 
it in the notes. 

Let us now return to the problem of the meaning of Feynman’s integral. It has to 
be mentioned first that a functional integral similar to (9.6) was studied, long before 
Feynman’s papers, in connection with the mathematical theory of Brownian motion. 
To describe it briefly, we select a particular class of functions on the path space I, ; 
the latter is chosen as the family of all continuous functions y : [0,t] + JR" such 
that y(t) =x. The space I’, as a subset of C((0, t], IR") is naturally equipped with 
the norm-induced topology || - ||. , and this topology in turn determines the system 
B, of Borel sets. A function f : [, + € is called cylindrical if f(y) depends only 
on the values of the function y at a finite sequence of points {omge: O =arigecerpe< 
... <™ =t}; if this is the case we write f(y) =: f(y(7),...,7(tw-1)). Denote 
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Ok ‘= Tk41—T ; then for each o > 0 there is just one Borel measure we on I’, such 
that the relation 


N-1 
J Flr), --+51(r04)) devel) = TT (@ni6x)-*? 
= k=0 
(9.7) 
1S (k+1)__ . (k))25—1 (0) (N-1)) 4, (0) (N-1) 
pleas op {= 3 Shy = On \ iy yoy”? idan” = « «digg 


holds for any Borel function which is cylindrical and bounded (see the notes for 
references to the proof). The measure w, is called the Wiener measure and the 
integral with respect to it is the Wiener integral. 

There are many similarities between Feynman and Wiener integrals. To il- 
lustrate this we put for simplicity m; = m, 7 = 1,...,n, and denote f(y) := 
exp (-i {6 ViV(s)) ds) w(y(0)) ; then the right side in the expression of (U(t)#)(zx) 
can be formally written as 


[ t) o (= ft@Pas) Dv. 


Comparing the Wiener integral (9.7) of a cylindrical function to (9.5) we see that it 
has the same formal expression with m = i/o. In this case, loosely speaking, the 
singularities of the exponential term and of Dy cancel mutually and we can replace 
exp (— 35 Jo l¥(s)|?ds ) Dy by dwo(y). 

nfortunately, this argument depends substantially on the fact that the expres- 
sion in the exponent is real and nonpositive, as the following result shows (see the 
notes for references to the proof): 


9.4.2 Theorem (Cameron): Let o be a nonzero complex number, Reo > 0. A 
finite complex measure w, such that the relation (9.7) holds for any Borel function 
f :T,—(, which is cylindrical and bounded, exists iff o € (0,00). 


It follows that the Feynman integral cannot be interpreted within the standard 
theory of integration. Hence if we want to use its advantages suggested by heuristic 
arguments, which we have indicated briefly above, it must be defined in another 
way. One possibility is to employ the Trotter formula as we did to derive the 
relation (9.4); the resulting expression is often called the product F-integral. Some 
other possibilities are reviewed in the notes. 

The Feynman idea in turn affected the theory of the Wiener integral: it inspired 
the so-called Feynman—Kac formula , which has found many applications in 
various parts of mathematical physics. We shall formulate it for n = 3 and the 
potential V € L? + L™, i.e., expressible as the sum of a pair of functions which 
belong to the two classes. The corresponding operator H = =P? +V is then 
self-adjoint by Theorem 14.1.2 and 


(ey) (x) = fC 6 M4 (4(0)) dara) (9.8) 
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holds for all 7 € L?(UR3) and a.a. x € JR®. Similar assertions are valid under much 
more general circumstances — see the notes. 


9.5 Nonconservative systems 


If the Hamiltonian is time-dependent, it is generally not easy to find a solution to 
equations of motion. We are now going to describe some methods which can be used 
to this end. The simplest among them is based on expanding the sought solution 
into a series. 


9.5.1 Theorem (Dyson expansion): Let H : IR > B(H) be a strongly continuous 
Hermitean—valued function, and set 


te = 6 + DUnlt,s)¢, (9.9) 
n=1 
with - 
ieee raf at, | dty .. ar dtp H(t)... H(tn)d 


for any ¢ € H. Then the series >>, U,,(t,s) converges with respect to the operator 
norm, U(t,s)¢ := y% defines a unitary propagator and the vector—valued function 
tr x, solves the Schrédinger equation 


tv = A (t)p 


with the initial condition 2), = ¢. 


9.5.2 Remark: If the function H(-) is operator-norm continuous, the existence of 
the integral 


Ue (=: yn fats f —_ ie dtp H(t;)... H(te) 


is easily established. In addition, if the family { H(t): t € IR} is commutative, the 
multiple integral can be simplified giving the relation U(t,s) = exp (-1 fi H(r) dr) , 
which generalizes the expression for propagators of conservative systems. Motivated 
by this, (9.9) is sometimes written symbolically as 


th = Te ff A(r)ar 


and called the time-ordered exponential of the Hamiltonian, the name referring to 
the ordered arguments in the above integrals. 


Proof of Theorem 1: Let us first check the existence and continuity of Up(-,s)¢ for 
any s € JR and ge H. If n = 1, this follows from the continuity of H(-) and 
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the absolute continuity of the integral; in the general case the result is obtained by 
induction from the recursive relation 


Unialt,s) = -i f “HNO aaloane 


Denote Kr := {[s,t] : s?+t? < T?} ; then |t—s| < 2T holds for each pair 
[s,t] € Kr and Js C [—T,T] for any closed interval J.¢ with endpoints s,t. By 
assumption, ||H(-)d|| is continuous and therefore bounded on [—T,T], in which 


case the uniform boundedness principle implies the existence of a positive Cy such 
that ||H(t)|| < Cr for all t € [-T,T]. Hence we have 


\Un(t, s)ol| < CRlldl lt aL egitlen: 


lg], n=1,2,..., 


so the operators U,,(t,s) are bounded. Denoting U‘%)(t,s) := I + DN_, Un(t,s), 
we deduce from the above estimate that the sequence {U‘%)(t,s)} converges with 
respect to the operator norm to some U(t,s) € B(H) uniformly on any compact 
subset of IR?. The relation (9.9) can then be written as y= U(t, s)d. 

To show that the operator U(t,s) has the needed properties, first note that 
Hermiticity in combination with a simple transformation of the integration domain 
yields (~,U,(t,s)*?) = (¢,Un(t, s))) = (Y,Un(s,t)b) for any ¢,y € H (Prob- 
lem 15), so Un(t,s)* = U,(s,t) ; then U(t,s)* = U(s,t) follows for all s,t € R 
from the operator—norm continuity of the adjoint operation. Next we shall check 
that the functions U,,(-,-) are operator-norm continuous. The just proved property 
implies 


\|Un(t, 8) — Un(to, $0)|| < ||Un(t, 8) — Un(to, s)l] + l|Un(s, to.) — Un(s0; to)|I - 


The unit ball B,([so,to]) is contained in Kr for T := 1+ \/s§+ #3, so using the 
recursive expression of U,,(t,s) we get 


|Un(t,s) ~ Untto,s) < sup (sen(t-to) f° ILH(4)Uma(ts,s)¢ll dt ) 
l}ol)=1 to 


(2TCr)"* 
ach 


IA 


Cr |t=t 


for (s,t) € B,({so,to]), and a similar estimate with |t—to| replaced by |s—so| in 
the other term; hence continuity follows. Since the sequence {U‘%)(t, s)} converges 
uniformly in compact sets, we conclude that (s,t) ++ U(t,s) is operator-norm 
continuous and, a fortiori, strongly continuous. 

To prove U(t,s)U(s,r) = U(t,r) for all r,s,t € R, it is sufficient to verify that 
the sequence of operators 


N 


UM) (t,s)U%(s,r) = (Exes oe x, | SU (GS) Un-g(Sy) , 


n=0j=0 n=N+4+1j=n—N 


336 CHAPTER 9. TIME EVOLUTION 


where we have set Uo(t,s) := I, converges with respect to the operator norm to 
U(t,r) as N — oo. The norm of the second part of the sum is bounded above by 
ee va1(4TCr)"/n!, which converges to zero, so the sought relation follows from 
the identity 


Bs U;(t, 8)Un4(s,7) = Un(t,r) , 


which is easily proved by induction (Problem 15). Combining it with the above 
results we see that { U(t,s): s,t € JR} is a unitary propagator. 

The last thing to prove is that the vector-valued function U(-,s)¢ satisfies the 
Schrédinger equation. The recursive expression of U,,(t,s) gives 


LUM (,9)¢ = ~iH()UO(t, 8)¢ 

for any ¢ € H. The right side converges to —iH(t)y as N — oo ; hence the left 
side is also convergent and we only have to show that its limit equals ¢ yw. In the 
same way as above we get 


" N 
| ; p> (U,(t+h, s) a Un(t, 8) )¢g % » g Ualts)0 | 
n=0 


n=0 


— 


1 C2 So — Uae d 
e ( Peet el £ Un(t.s)0} | 


n=0 


oo (2TCr)™" 
+Crlél Gar 


provided T > 14+ Vs?+t? and |h| < 1 (Problem 15). For any € > 0 there are 
No, R such that the last term on the right side is < 5 for N > No, and to such 
a number N there is 6 > 0 such that the first term is < 5 if 0 < |h| <6. Hence 
the norm on the left side is < € for all N < No and 0 < {h| < 6, and the proof is 


concluded by taking the limit Noo. ff 


In most practically interesting situations, however, the Hamiltonians are un- 
bounded, so Theorem 1 is not directly applicable. The difficulty can often be 
bypassed by using the interaction picture. Assume, for instance, that H(t) := 
Ho + V(t), where Ho is self-adjoint and V(-) is a strongly continuous Hermitean— 
valued function, so H(t) is self-adjoint and D(H(t)) = D(Ho) for any te R. 
The function Vp : Vp(t) = Uo(t—s)~! V(t) Up(t—s) is again Hermitean-valued and 
strongly continuous, so the equation (9.3) has the solution wp(t) := Up(t,s)y, for 
any Ws, € H, which is given by (9.9) with H(t;) replaced by Vp(t;). Defining 


W, := Up(t—s)Up(t, s)v., 


we see that the function t + ¢%, formally obeys the Schrédinger equation with the 
Hamiltonian H(t). To prove that this is indeed a solution we have to check that 
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Up(t,s) maps D(Hpo) into itself, which may not be true if the operators Ho, V(t) 
do not commute. 

A more powerful method of establishing the existence of solutions to a time— 
dependent Schrédinger equation is obtained if the propagator is approximated by a 
suitable sequence of products of unitary operators. We have 


U(t,s) = U(t,t—6,)U(t—6n, t—26,) ... U(s+6n, 8) 


for any positive integer n where 6, := (t—s)/n ; the approximation consists of 
replacing the function H(-) in the intervals (s+(k—1)6n,s+k6,) by its value at 
some point, say, ™ := s+ké6,. This gives 


Unt, 8) — et (en tH (Tra )bn cae eA (t1)6n 


and one expects these expressions to approximate the propagator, 
Oso slim U,(t,8)e (9.10) 


provided the function H(-) is smooth enough. We shall present one result of this 
type without proof; more details can be found in the notes. 


9.5.3 Theorem: Suppose the Hamiltonian is of the form H(t) = Ho + V(t) for t 
belonging to an open interval J C AR, with Ho, V(t) self-adjoint, and denote 
C(t, 8) := (H(t)—7)(H(s)—i)—!— I. In addition, assume that 


(i) V(t) is Ho—bounded with the relative bound <1 for any te J, 


(ii) the function (s,t) ++ (t—s)~1C(t, s)@ is bounded and uniformly continuous for 
any ¢€H if s,t belong to a compact subinterval of J and s ¥t, 


(iii) the limit C(t)d := lim,2.(t—s)~'C(t, 8) exists for any ¢ € H, uniformly in 
each compact subinterval, and the function C(-) is strongly continuous and 
bounded. 


Then for any s,t € J, s < t, the convergence in (9.10) is uniform in each com- 
pact subinterval and the relation defines U(t,s) such that y := U(t,s)¢ for any 
¢@ € D(Ho) solves the Schrédinger equation with the Hamiltonian H(t) and initial 
condition ~, = ¢. 


One of the consequences is that Dyson’s expansion is applicable in the interaction 
picture provided the function V(-) is strongly differentiable. 


9.5.4 Corollary: Let H(t) := Hot+V(t) with Ho self-adjoint and V(t) Hermitean 
for any t from an open interval J Cc R. If £V(t)d =: G(t)¢@ exists for all 
te J, ¢€H, and the function G is strongly continuous in J ; then the conclusions 
of Theorem 3 are valid. 
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Proof: The operators V(t) are bounded, so it is sufficient to check the validity of 
the assumptions (ii), (iii). We define the function F by 


(t—s)~!C(t, s) ence, OL 
F(s,t) = 
G@Q(HO=1)" «as SH 


Since ||(H(s)—7)'|| < 1, its values are bounded operators; in view of the second 
resolvent identity, ||(H(-)—7)~1|| is strongly continuous. The sequentiai continuity 
of operator multiplication then implies that F'(-,-) is continuous for s # t, and 
due to the assumption, it is also continuous in the “diagonal” points of J x J. 
This means, in particular, that ||F(-,-)@|| is for any ¢ € H and a compact interval 
Jo C J continuous in Jo x Jo, and is therefore uniformly continuous in this set; the 
validity of (ii), (iii) follows easily from here. 


In fact, Theorem 3 allows us to establish the existence of the propagator in even 
more general situations when the interaction term is unbounded — see the notes. 
Let us now briefly mention an application of Dyson’s expansion. 


9.5.5 Example: Suppose the system, which at the initial instant t = 0 is in a state 
described by an eigenvector ¢; of Ho, is exposed to a time-dependent interaction 
V(t) during (0,7) ; we are interested in the corresponding transition probability 
to another eigenstate ¢, of the unperturbed Hamiltonian. If the assumptions of 
Corollary 4 are valid, we get 


(bu, U(t,0)¢3) = yD (-ar fi aty fat 


x (dx, Uo(t—ti)V (t1)U0(ti te) V (te)... V (tn) Uo(tn)o;) 


since the zeroth term is absent for k #7. In the first approximation, the transition 
amplitude equals 


(Gr, U(t,0)9;) = -ie [ * eiirrs)s (dx, V(s);) ds + R2(V,t), 


where A;, Ax are the respective eigenvalues and the remainder term can be estimated 
as in the proof of Theorem 1, |R2(V,t)| < e@'— 1— Ct provided ||V(t)|| < C for 
t € (0,T), so it is of order O(t?) for small ¢t. In a similar way, we can derive 
higher approximations. It is natural to expect that the transition probability will 
be small for V(t) which is only slowly changing. If we replace V by V; := V(t/r) 
then the leading-order contribution to the transition probability is easily seen to be 
< (Ct/r)?, so it vanishes for 7 > oo. This is a particular case of a much more 
general result — see the notes. 


It is usually much more difficult than in the conservative case to derive the 
propagator of a time-dependent system in a closed form. Sometimes one can find 
an inspiration in the solution of the corresponding classical problem. 
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9.5.6 Example: Consider again the harmonic oscillator, this time with time— 
dependent perturbation, 


i ip 
HG) =s— Pt =mw?Q? ; 
(t) oa + 5m Q?+ f(t)Q 
It is known from classical mechanics that the driven-oscillator coordinate and mo- 


mentum trajectories are shifted by —(mw)~!G(t, s) and a(t,s), respectively, where 


ats) = fio cosw(r—t)dr, (t,s) := - fie) sin w(r—t) dr 


(Problem 16a). We found in Example 9.3.5 that coherent-state wave packets follow 
classical trajectories in the unperturbed oscillator. This suggests that we might try 
to “shift” the corresponding propagator by the Weyl operator W(—G/mw,—a) (cf. 
Example 8.2.7), 2.e., to look for the propagator in the form 


U(t,s) = e714") e—talhe) 9 iP) Pimary (4a) , (9.11) 


where U)(-) is the propagator of Theorem 9.3.6 and + is a phase factor to be found. 
Using the corresponding Schrédinger equation we find that, choosing 


(ts) = — == f (alr,s)?— Blr,8)?)dr, 


we do indeed get the sought propagator (Problem 16b). 


Some classes of nonconservative systems are particularly important in applica- 
tions. In Example 5 above we mentioned the case when the Hamiltonian is slowly 
varying. Another important situation occurs when the time dependence is periodic, 
H(t) = H(t+T) for some T > 0 and all t € RA. It then follows from postulate 
(Q4c) that the evolution operator satisfies U(t+T,s+T) = U(t,s) , and therefore 


U(t+nT,s) = U(t,s)(U(s+T, s))” (9.12) 


for any integer n and arbitrary s,t € IR. Hence the most important contribution to 
the evolution of states of time—periodic systems over long time intervals comes from 
powers of the evolution operator over one period, the remaining part being the same 
within each period. The operator U(s+T,s) for a fixed s is called the monodromy 
operator. The dependence on s is unimportant: the periodicity assumption together 
with the definition of the propagator implies that all monodromy operators are 
unitarily equivalent, so they have, in particular, identical spectral properties. Recall 
that as a unitary operator, U(s+T,s) has a spectral decomposition; we can also 
write it as U(s+T,s) =: e'47 (cf. Problem 5.31) in which case the operator A is 
usually called the quasienergy. 

It appears that the evolution of time-periodic systems depends substantially 
on the spectral properties of U(s+T,s). To illustrate this, consider the situation 
when the monodromy operator has an eigenvalue. The corresponding state changes, 
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of course, during the period; however, it follows a closed loop in the state space 
returning to the same ray after each period. In this respect it is similar to a sta- 
tionary state. On the other hand, if the state belongs to the continuous spectral 
subspace of U(s+T,s) then it exhibits no recurrent behavior despite the fact that 
H(-) is periodic. The spectral properties of the quasienergy operator can be rather 
unstable. 


9.5.7 Example: Consider the driven harmonic oscillator of the preceding example 
and choose f(t) := sin Qt with Q > 0. The propagator (9.11) is easily found 
(Problem 16c); in particular, we have 


Q(cos wt — cos Nt Q sin wt — wsin Nt 4 
a(t,0) = ee B(t,0) = — if Aw 
F ist 
a0) ae ae A(t,0) = —<— = ua 


We denote by T := 27/Q the driving force period; then for 2 = w we obtain 
U(T,0) = —e~*(P%e-**P/m™ due to Remark 9.3.7, and therefore the monodromy 
operator can easily be checked to have a purely continuous spectrum, , dac(U(T,0)) = 
{z€C: |z| = 1}, in view of the unitary equivalence (4.4). We have mentioned that 
the spectrum is independent of the initial instant; however, the operators themselves 
differ: we have, e.g., U(5T/4,T/4) = — e~*"T/4:7/4) e-(5T/8)Q 

On the other hand, if the driving frequency is not in resonance with the proper 
frequency of the oscillator, 2 #4 w, the character of the spectrum changes. The 
reader is asked in Problem 16d to check that the point spectrum of U(T,0) is 
nonempty, but in fact a much stronger result is valid (see the notes). 


9.6 Unstable systems 


Most microscopic objects in physics are unstable. We know, for instance, that a 
charged pion will almost surely decay into a muon and a neutrino after 107®s, or 
that an excited atom will eventually radiate one or more photons and pass to the 
ground state as a result of its interaction with the electromagnetic field, etc. In spite 
of the different physical mechanisms which govern decay processes, the latter have 
some common properties; this is the topic of this section. 

The propagator associated with an unstable system should be nonunitary to 
describe the fact that, roughly speaking, the probability of finding the system in 
the original undecayed state decreases with time. To avoid a contradiction with 
postulate (Q4b) the unstable system cannot be regarded as isolated; in other words, 
we have to treat it as a part of a larger system including its decay products. 

Consider an isolated system S with a state space H on which the evolution op- 
erator U acts. If H, is a proper subspace in H which is noninvariant with respect 
to U(t), then U(t) = E,U(t)w+(I—E,)U(t)y holds for ~ € Hy, , where the second 
term on the right side is generally non-zero; in that case we have ||E,,U(t)~|| < |\|\ . 
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A state 7% € H, thus evolves into a superposition of vectors from the subspaces Hy 
and H: ; if we identify the rays in these subspaces with the states of the unstable 
system and its decay products, respectively, we get a natural scheme for description 
of unstable systems in quantum theory: 


(u) the state Hilbert space H,, of an unstable system is a subspace in the state 
space H of a larger isolated system S. Time evolution in H is described by 
the unitary propagator U : U(t) = e~*#*, where H is the Hamiltonian of 
system S. The subspace H,, fails to be invariant with respect to U(t) for 
any t>0. 


The last part is substantial; it expresses the fact that a state vector can leave the 
subspace 7,,. 

As in the previous sections we shall consider pure states only. Time evolution 
of the unstable system alone is determined by the reduced propagator 


V: V(t) = £,0(¢) Meare 


where EF, is the projection onto the subspace H,, C 1. We assume that the state 
of the system at the initial time t= 0 is described by a unit vector ~ € H, ; then 
the decay law is defined by 


P,(t) = |V@¢vl? = |Z. (ey |/?. 


. Its value is therefore interpreted as the probability of finding the system undecayed 
in a measurement performed at the instant ¢, or in other words, of obtaining a 
positive result in the yes—no experiment associated with projection E,,. 


9.6.1 Proposition: The function V(-) is strongly continuous, and the relations 
V(t)* = V(-t) and ||V(¢)|| <1 hold for any t € IR. The reduced propagator does 
not have the group property: there are s,t € JR such that V(s)V(t) 4 V(s+2). 
The decay law P,(-) is for any ~ € Hy, a continuous function which satisfies the 
relations 0 < P,(t) < P,(0) =1. 

Proof is left to the reader (Problem 17b). 


9.6.2 Example: Let H,, be a one-dimensional subspace in spanned by the unit 
vector ~. The reduced propagator acts as a multiplication by u(t) := (~, U(t)w) 
and the decay law is P,(t) = |v(t)|? ; using the spectral decomposition of the 
Hamiltonian, we can rewrite it as 


Palt) = |f edt, EW) 


The vector 7 usually belongs to the absolutely continuous subspace H,,-(H) ; in 
that case the Riemann-Lebesgue lemma implies lim;_,.. Py(t) = 0. For instance, 
if ~ := Wu is such that the corresponding measure is given by the Breit-Wigner 
formula of Example 7.4.6, then the decay law is of the exponential form 


2 
= ett 


2 


—trt 


if e 
Qn Ih (A=Ao)? + IF? a 


Py, (t) = 
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for all t > 0 since a simple integration gives v(t) = e~otTl4/2 

9.6.3 Remark: A typical feature of the exponential decay law is that its initial de- 
cay rate, defined as the one-sided derivative P,(0+) =I, is positive. This is closely 
connected to the fact that yw, is not a finite-energy state — see Problem 18 and 
Example 7.4.6. This does not mean, however, that we can use it to decide whether a 
given unstable state is a finite-energy state or not; like any other measurement of a 
continuous quantity, time can be determined only with a finite resolution, so in rea- 
lity we are not able to measure the derivative of P,(-). Recall the result expressed 
by Proposition 7.4.7; we can again construct the truncated states yy := E(As)v, 
which are experimentally indistinguishable from w~ for b large enough. For any 
€ >0 there is bo such that |/~—w»l| < © for all 6 > bo, and therefore 


|Py(t) — Py(t)| <2] |BuU(e)voll - EWU )oll | < 2B O)(do— ¢)I| < 2e; 


at the same time 7 as a finite-energy state has zero initial decay rate (see also 
Problem 19a). 


The reduced evolution operator corresponding to the exponential! decay law sat- 
isfies the semigroup condition, 


V(s)V(t) = V(s+t), s,t>0, 
Propagators with this property are also used for the description of more complicated 


decays. 


9.6.4 Example (decay of neutral kaons): If we disregard the configuration-space 
degrees of freedom we may associate a two-dimensional space H, ~ €? with a 
neutral kaon. Its basis is formed either by 


bio =(3): oe = (2) 


or by the pair of nonorthogonal vectors 


- 1l+e = l+e 
dm (ite), tou ( 1h), 


where N. := (1 + |e|?)~'/? and € is a complex parameter whose experimentally 
determined value is about (2.3 x 107%)e"/4. The vectors #5, dz represent the 
short~ and long-living component in the weak decay of a kaon-antikaon superposi- 
tion. The operators Ps, P, on H, are represented in the orthonormal basis by the 


matrices 
oo 1 ) ee: a 1 —b, 
Pam 5 NS Pom 3 ( js ale 


where 6, := (1+¢€)/(1—€) ; we can check that they are nonorthogonal projections, 
P; Pe = b5eP_ and Pb, = b;nb% for j,k = S,L. Time evolution in H, is conven- 
tionally described by the reduced propagator 


Vi: V(t) = Pse48* + Py e Mt 
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where z;:= m;— $I; with ms © mz © 498 MeV, I's = 3.7 x 107! MeV, and 
3’; = 0.64 x 1074 MeV. It is easy to see that V satisfies the semigroup condition; 
however, the decay is exponential only if it starts from some of the states os, oz 
(Problem 19b). 


The trouble with the semigroup condition is that it in fact represents a strong 
requirement on the energy spectrum. We have seen already in Example 2 that 
o(H) = R must hold for the Hamiltonian corresponding to the exponential decay 
law; otherwise the integrated function could not be nonzero for all A € R. It 
appears that this is also true in a more general situation (see the notes): 


9.6.5 Theorem: Suppose that the spaces H, C H and the propagator U(t) := 
e~*#t satisfy assumption (u). If the semigroup condition is valid for the reduced 
propagator, then o(H)= R. 


The Hamiltonian is usually a positive operator; in relativistic quantum field 
theories one even postulates its positivity. The above result then means that a 
semigroup reduced evolution cannot satisfy the semigroup condition, in particular, 
that exponential decay laws are excluded. This is no disaster, however, because it 
tells us only that the semigroup condition cannot be ezactly valid; we know from 
Remark 3 and Problem 19a that a decay law with a below bounded Hamiltonian 
can differ so little from the exponential one that the difference is irrelevant from 
the experimentalist’s point of view; the same is also true in a more general con- 
text. Hence the semigroup description of decays is necessarily approximate, but the 
approximation is good enough for every practical purpose. 

To be able to grasp better the nature of this approximation, we introduce the 
reduced resolvent of the Hamiltonian with respect to the subspace H, by 


Ri,(z) = BRu(2) ) Hu 


and use it to express the reduced propagator. Using the spectral decomposition of 
the Hamiltonian we can write 


V(t) = ie eta Fy 


for any w € H,, where Fy = F(—oo, A] := EE! \H.u. Further, using the Stone 
formula, we can express the vector measure appearing in the last relation, 


d2 
5 (FD dl + FO d)} = so slim [” [Ri (E+in) ~ RY (E—in)] de. 
The support of F(-)z is obviously contained in o(H), and the same is true for 
supp F := Uyen, supp F(-)#. Hence, while the resolvent Ry(€) does not exist at a 
point € € suppF by definition, it is not excluded at the same time that the limits 
s-limyo+ R},(€ + in) exist for the reduced resolvent, and the subintegral function 
in the last formula is bounded for (€,7) € [A1, 2] x [—70,70] with some 7 > 0, 
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in which case the limit can be interchanged with the integral by the dominated- 
convergence theorem. Furthermore, the absolute continuity of the integral implies 
F({A}) =0 for any 4 € o(#), so we get 


A2 
Fda) = 55 [” glim [Ra (E-+in) ~ Ru(E—in)] ds. 
Since the resolvent is analytic in p(H), the same is true for R}(-). In general, 
the reduced resolvent is not continuous when we cross o(H) , because otherwise the 
above relations would imply V(0) = 0. It is not excluded, however, that R%(-) has 
an analytic continuation across o(H), 1.e., that there is an analytic function in a 
region 2. C (containing o(H), which coincides with Rj,(-) in the upper halfplane. 
The situation is particularly interesting when this continuation has a meromorphic 
structure, i.e., isolated poles in the lower halfplane. 


9.6.6 Example: Let us return to Example 2. Since dim#H, = 1 the reduced 
resolvent acts as the operator of multiplication by r},(z). Suppose that 


Qa 
Rp — 2 


ry(z) = + f(z) 

for Imz > 0, where a > 0, f is a holomorphic function, and zp := Ap— iby is 
a point in the lower halfplane, 7.e., that r#(-) has the analytic continuation from 
the upper halfplane to € \ {z)}. Using elementary properties of the resolvent we 
find r#(A—in) = r},(A+in) , so the measure F(-) , which is numerical—valued in the 
present case, is given by 


a il i il 
F(A) = aah (<s -<4) dd + — fm f(r) dd. 


This yields further an expression for the reduced propagator, 


a 2i6 : 1 
t) = — pee ae 4 int 
v( ) ori [. (A= Ap)? + 62 € dX + " ‘he e Im fQ) dr. 


The first integral can be evaluated by the residue theorem (Problem 20) giving 
v(t) = ae~rtSltl 4. - f et Im f(A) dd. 
R 


Hence the reduced propagator is expressed as the sum of two terms, the first of 
which corresponds to the pole term and the other to the remaining analytic part of 
the resolvent. We often encounter the situation where the modulus of the second 
term is <1 for all t > 0. Then a = 1 because v is a continuous function with 
v(0) = 1, and the decay law is approximately exponential with [ = 265 . 


The argument described in the example applies in more complicated situations 
too (see the notes), provided the reduced resolvent has a meromorphic continuation 
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to the lower halfplane. The question of circumstances under which such a continu- 
ation exists is nontrivial and the affirmative answer is known only in some cases. 

In the rest of this section we are going to illustrate this on one of the simplest 
examples, which is usually called the Friedrichs model. Let us first describe it. 
The state Hilbert space will be of the form 


H := © © L(R*), 


where the one-dimensional subspace is identified with the state space H., of the 


unstable system; the states are thus described by the pairs ( 7 ib where a is a 


complex number and f € L?(JR+). The Hamiltonian is chosen as Hy := Ho+gV, 
where g is the coupling constant and the free Hamiltonian Hp is defined by 


sa ee Ao 
mT) (ar J 
where Ao is a positive parameter and (Qf)(€) = €f(€) ; hence its continuous 


spectrum covers the positive real axis, o.(Ho) = JR*, and the eigenvalue Ag is 
embedded in it. The interaction Hamiltonian is chosen as 


"(a)=(%P) 


where v is a given function from L*(IR+) ; we can easily check that this opera- 
tor satisfies the Friedrichs condition EqVEq = 0, where Eq is the projection to 
Ha := L?(IR*+). The operator H, is self-adjoint for any g € JR and its domain is 
independent of g (Problem 21a). The above condition makes the Friedrichs model 
explicitly solvable because it allows us to determine the reduced resolvent. 


9.6.7 Proposition: Let Imz # 0; then the operator R¥(z) acts on H, as the 
multiplication by 


r(z) -(- c++ ef vail WF an) 


Proof: We apply the second resolvent formula to the operators H,, Ho ; it yields 


E,Ru,(z)Eu = EuRu(z)Eu 


sm gEWRHy (z)EuV EuRu, (z) Eu ~ gEwRy(z) Eu Baku, (z) Ey ) 


where we have used the commutativity of operators E, and Ry,(z) together with 
the identity FE, + Ea =I, and furthermore, the relation 


EaRu,(z)E. = —9EaRu,(z) Ea EuRu, (2) Eu, 
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where we have also employed the Friedrichs condition. Next we substitute from the 
second relation to the first, and apply to the resulting equation (Ho— z)E, from 
the left. Since (Ho— z)E,Ruy,(z) = Eu, after a simple manipulation we get 

[(Ho- 2) Eu + 9EwV Eu — 9° BuV EaR(2)EaV Eu] EuRu,(2)Eu = Bu. 


Now, using the explicit form of the operators Ho, V , we obtain 


(- Addn gy af = Iw(AyF Fan) r(z)b = o 


for y € H,, t.e., the sought result. 


To be able to use the conclusions of Example 6, we have to know whether 
rg(-) has an analytic continuation. The answer clearly depends on the properties of 
function v. We shall assume that 


(a) there is an entire function f : © — € such that [v(A)|? = f(A) for all 
d € (0,00) ; for the sake of notational simplicity we write f(z) = |v(z)|? for 
nonreal z too. 


Then the needed continuation exists. 


9.6.8 Proposition: Assume (a); then o.(H,) = IR* for any g € IR, and r}(-) has 


an analytic continuation from the upper halfplane, r(z) = [—z + w(z, g)|7', where 
w(d,g) = Ao +g7I(A) — rig? |v(A)|? es) 
oo 2 
agi 2 lv(6)| ee 2 on . 
w(z,g) o+g [ ey; d& — 2mrig*|v(z)| Imz <0 


and I(X) is defined as the principal value of the integral, 
() me) OP 
=p a “ tha 
i o pa 0+ [ j (ime a € i 


Proof: As a finite-rank operator, V is compact so dess(Hg) = Cess(Ho) = 
by Theorem 5.4.6. Furthermore, H, has only simple eigenvalues of which at most 
one lies in JIR~ while the eigenvalues in (0,00) can be only at the points where 
v(A) = 0 (Problem 21). Since |v(-)|? is analytic by assumption, the eigenvalues 
have no accumulation point, so ¢-(Hg) = dess(Hg) = IR*. 

To prove the second assertion we have to check that the function w(-,g) defined 
as above for Imz <0, and by 


ulna) = do + of” POF ae 
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for Imz > 0, is analytic in ('\ IR~ . Its analyticity in the upper and lower complex 
halfplanes verifies easily, and in view of assumption (a), w(-,g) is continuous when 
crossing (0,00) (Problem 22), and therefore uniformly continuous in any compact 
McC\f. Its analyticity then follows from the edge-of-the-wedge theorem (see 
the notes). 


These properties of the reduced resolvent make it possible to prove the mero- 
morphic structure of its analytic continuation. 


9.6.9 Theorem: Suppose that assumption (a) is valid and v(A9) # 0. Then there 
is an € > 0 such that for all nonzero g € (—e,¢) the function r of Proposition 8 
has just one simple pole at the point 2z)(g) := Ap(g) — i6p(g). The function zp(-) is 
infinitely differentiable and the expansion 


Ap(g) = Ao +97L(Ao) + O(9*), S(9) = 1g? |v(Ao)|? + O(g*) . (9.13) 


is valid in the vicinity of the point g = 0. 

Proof: We define F: (x (C\ R-) — € by F(g,z) := z—w(z,g) ; possible poles of 
r(-) obviously coincide with the zeros of F’. In view of Proposition 8, the function 
F is infinitely differentiable in both variables. It also satisfies F'(0,A9) = 0 and 
(OF /0z)(0, Ao) = 1; thus we may use the implicit—function theorem, which implies 
the existence of a complex neighborhood U, of the point g = 0 and an analytic 
function z»: U; + @ such that F(g,zp(g)) =0 for g € U,. Since F' has contin- 
uous partial derivatives, (OF /0z)(-, Zp(-)) is also continuous, and therefore nonzero 
in some neighborhood U2 Cc U, of g = 0. This means, in particular, that r(-) 
has at the point z,(g) a simple pole provided g € U.N R. Finally, the Taylor 
expansion (9.13) is obtained by computing the derivatives of the implicit function 
Z(-) at g=0 (Problem 23). & 


Hence the Friedrichs model fits into the scheme described in Example 6 provided 
the coupling is weak enough. Under additional assumptions we can prove that the 
pole part of reduced resolvent r(-) dominates over the analytic remainder (see the 
notes); in that case the decay law of the initial state , := ( a ) is approximately 


0 
exponential and 
T'(g) = 6p(g) = 2xg"|v(Ao)? + O(9*) 
is the corresponding decay rate. 


9.6.10 Remark: It is illustrative to compare this expression to the commonly used 
way of computing the decay rate by the so-called Fermi golden rule, 


> 


d 
Pr(g) = 2ng? 5 (Vu, Ey” Pe(Ho)V vw) 
A=Ao 


where {Eo } is the spectral decomposition for Hp and P.(Ho) the projection 
to the continuous subspace of this operator. To realize that ['r(g) is indeed the 
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formula known from quantum-mechanical textbooks, recall that we assume h = 1 
and formally 2 BN P(E) = |\)(A|. Using the explicit form of the operators Ho 
and V, we find 


= 21g? |v(ro) |’, 
A=Ao 


Tr(a) = 2g? [wie Pde 


i.e., 'p(g) is nothing else than the first nonzero term in the Taylor expansion of 
the function ['(-). 


The Friedrichs—-model example illustrates a characteristic feature of many decay 
processes: the free Hamiltonian Ho has an embedded eigenvalue, which disappears 
when the interaction is switched on, but leaves a footmark in the form of a pole in 
the analytically continued resolvent. Another problem of this type will be discussed 
in Section 15.4, and further references are given in the notes. 


Notes to Chapter 9 


Section 9.1 The detection of cosmic muons is discussed in the framework of the classical 
theory of relativity in [ Vot ], Sec.IV.4; for the quantum description of moving-particle 
decay see [BN 1], [Ex 3]; [ Ex], Sec.3.5. The Wigner theorem that we have employed to 
motivate the postulate (Q4a) is proved, e.g., in [FG], Sec.1.3; [BaR], Sec.13.2. Theorem 2 
is particularly useful when we investigate the classical limit of quantum dynamics, as 
Corollary 4 illustrates. However, the Ehrenfest relation itself does not ensure existence of 
the correct classical limit because in general (F}(Q)) # Fj((Q1),--.,(Qn)) ; more on this 
subject can be found in [Hep 1]; [Thi 3], Sec.3.3. 


Section 9.2 The unitary—valued function V : V(t,s) = Uo(s—t)U(t,s) that appears in 
the definition of the interaction picture is not a unitary propagator; however, we can use 
it to define a unitary propagator relating the vectors wp at different times (Problem 6). 
The nonexistence of the interaction picture in relativistic quantum field theories does not 
mean that the perturbative calculations used there are generally incorrect. Most of them 
can be justified; however, a presentation of the problem exceeds the scope of this book — 
we refer, e.g., to [GJ]. 


Section 9.3 Theorem 1 claims that time evolution preserves smoothness of the initial 
condition. In fact a stronger result is valid, namely that possible singularities of the 
function U(t,s)y are weaker than those of w if t > s, and the differentiability properties 
are improved by the propagator. This is true not only in the free case but also for a 
wide class of Hamiltonians with potential and magnetic-field interactions including time- 
dependent ones — see, e.g., [Ya 1]. 

The treatment of the harmonic oscillator easily generalizes to the n-dimensional sit- 
uation. In addition to the straightforward proof proposed as Problem 9, the propagator 
kernel of Theorem 6 can be derived formally by evaluating the corresponding Feynman 
integral — see [FH], Sec.3.6; [Schu], Chap.6; with an appropriate definition of the latter, 
such an argument can be made rigorous. It is also possible to avoid the use of the Feyn- 
man integral if the kernel is expressed by means of the Trotter formula — ef. [CRRS 1], 
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and also [ Ex], Sec.6.2. Other proofs of Theorem 6 are given in [Ito 2], [Tru 1]. Ina 
similar way, the Feynman integral can be employed to derive explicit expressions of the 
propagator or resolvent kernel for many other simple systems — see, e.g., [Cra 1], [CH 1], 
[DLSS 1], {CCH 1], [BCGH 1], [KL 1], [Gro 1}; the two last named reviews, as well as 
[Schu], contain many other references. 


The last term in the exponential factor of the harmonic-oscillator kernel, which causes 
a jump in phase every halfperiod, is called the Maslov correction. It appears for other 
potentials too, typically at the instants when the classical particle passes a turning point 
— see [ETr 1]. The behavior of minimum-uncertainty states discussed in Example 5 was 
first pointed out by E. Schrédinger in [Schr 1]; the question whether a similar situation 
can occur for other potentials is discussed in [NSG 1]. 


Section 9.4 The expression of a propagator in form of the “path sum” was proposed by 
R. Feynman in 1942; he in turn was inspired by the earlier work of Dirac. He published 
his results in a series of papers starting from [Fey 1]; they are summarized in [FH]. A 
mathematician’s view of the Feynman papers can be found in [Jo 2]. Among other books 
dealing with physical applications of Feynman integrals we may mention, e.g., | LRSS], 
[Pop], [Schu], and [SF] devoted to the quantum theory of gauge fields. 

We mentioned in Section 9.3 that the classical limit of quantum mechanics can be 
interpreted as the limit A — 0. Had the right side of (9.6) been expressed as a standard 
integral, we might perform this limit by the stationary-phase method. A generalization of 
this procedure to one of the possible definitions of the Feynman integral can be found in 
{ AH 1]; among other papers studying the classical limit and semiclassical approximation 
with the help of Feynman integrals let us mention, for instance, [DeW 1], [Tru 2]. 

The Wiener measure theory is explained, e.g., in the monographs [ Kuo], [Si 4]. A 
peculiar property of this measure is that smooth paths form only a small subset in the 
path space T,. A function 7 ET, is said to be Hélder continuous of order a if there is 
a C such that |y(r) — 7(s)| < C|r — s|* holds for all r,s € [0,t]. Denote by Ha the set 
of all y€ I with this property; then we(Ha) =1 for 0<a< 4 while wo(Ha) =0 for 
3 <a<1. This means that a “typical” path is rather irregular; in particular, the set of 
all smooth paths has the Wiener measure zero. 

A formula of type (9.8) was obtained for the first time in [Kac 1]; its proof under the 
stated assumptions can be found in | RS 2], Sec.X.11. A few independent proofs which 
require stronger hypotheses about the potential but no restriction to dimension n is given 
in [Si 4], see also [GJ], Chap.3; a mathematical generalization can be found in [Lap 1}. 
The Feynman-Kac formula is a powerful tool for analyzing Schrédinger operators — see, 
e.g., [Si 4] and [Si 3]; a generalization to systems with relativistic kinetic energy is 
given in [CMS 1]. The Feynman-Kac formula can also be derived for systems with an 
infinite number of degrees of freedom. This result plays an important role in the Euclidean 
approach to constructing models of interacting quantum fields — see, e.g., [Si 2], Chap.V; 
[GJ], Chaps.19, 20. 

Theorem 3, which claims that Feynman integrals are not integrals in the usual sense, 
is taken from [Cam 1]. The proof of nonexistence of the measure wz for Imo # 0 is based 
on showing, with the help of a suitably chosen sequence of cylindrical functions, that such 
a measure should have an infinite variation — see also [Tru 3]; [Ex], Sec.5.1. This is due 
to the oscillatory behavior of the exponential term that formally determines the measure 
We. On a heuristic level, the following conclusion can be formulated: if we want to give a 
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reasonable meaning to expressions of the type fr. f(7) e*50(7) Dy , we have to restrict our 
attention to sufficiently smooth functions f to suppress the influence of oscillations. It is 
worth mentioning that the analogous functional integral for relativistic systems described 
by a Dirac operator has been shown in [Ich 1] to exist for two-dimensional space-time, 
while in the realistic case of dimension four the corresponding path measure again does 
not make sense — cf. [Zas 1]. 

There are many ways of defining Feynman integrals rigorously. We will briefly mention 
the most important of them; a detailed exposition with further references can be found in 
[ ACH] and [Ex], where relations between various approaches are also discussed. The 
procedure used here to express the propagator by Trotter’s formula can be adopted as a 
definition of the Feynman integral; this idea first appeared in [Nel 2]. However, it is closer 
to the spirit of Feynman’s heuristic approach to use expressions of type (9.5), which contain 
the eract action along a given polygonal path. Such definitions have been formulated and 
their equivalence with a Trotter-type definition has been proved for various classes of 
potentials, e.g., in [Cam 1-3], [Tru 1-4]. In [Ito 1,2], the expression fy_ f(7) ee Dy 
is defined as a limit of a net of integrals with respect to suitably chosen Gaussian measures 
on the path space. 

There are also other definitions which do not require limits; they employ, e.g., objects 
generalizing the notion of measure [DMN 1]; another approach works with generalizations 
of Fresnel integrals — see [ AH], Chaps. 2,4. Finally, there are prescriptions using the 
Wiener measure theory, in which the sought object is defined by analytic continuation — 
see [Cam 2], [Nel 2], [JS 1], ete. — in mass or in time. It is shown in [Jo 1] that the 
“Fresnelian” definition can be extended in this way. One of the main aims of the quoted 
papers was to give meaning to the relation (9.6) , and therefore most of them contain 
expressions of the propagator using the respective definition of the F-integral; pioneering 
work was done in [Cam 1], [Ito 1], [Nel 2]. Formulas of type (9.6) are also valid for 
complex potentials — see [| Ex], Chap.6. 


Section 9.5 The methods of solving the time-dependent Schrédinger equation discussed 
here are inspired by ways of solving the Cauchy problem for ordinary differential equations. 
In the case of Theorem 1 we use the fact that the original equation is formally equivalent 
to the Volterra equation 


te = 6-4 f H(edr 


we have to check that the iteration series makes sense. The expansion is named after 
F. Dyson, who used it in his pioneering work on quantum electrodynamics (1949). Theo- 
rem 3 represents a particular case of a general existence result for evolution equations in 
Banach spaces, which can be found in [RS 2], Sec.X.12. The idea is again inspired by the 
theory of ordinary differential equations; it was first applied to vector-valued functions in 
[Ka 1]; see also [ Yo], Sec.XIV.4. Similar results can be proved under weaker assumptions 
on the time-dependent part, so that, e.g., Schrédinger operators with moving Coulomb 
potentials are included — see [Ya 1], and [KY 1] for the Dirac operator. 

If H(t) := Ho + V(t), where Ho is the free Hamiltonian (9.2) and V(t) is a time- 
dependent potential, the relation (9.10) can be rewritten in a form analogous to (9.4), 
i.e., as a Feynman integral where the action now depends parametrically on time — see, 
e.g., [Far 2], [EK 1}; [Ex], Sec.6.1. This fact has indeed been used to compute explicit 
expression for propagators of some simple systems — examples can be found, for instance, 
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in [KL 1], Sec.9; [Du 1]. 

The result mentioned at the end of Example 5 is related to the adiabatic theorem, 
which can be briefly formulated as follows. An isolated eigenvalue A(t’) of the Hamiltonian 
H(t’) is called regular if the corresponding projection E(t’) is finite-dimensional and the 
functions E(-) and (H(-)—X(-))~!(I—E(-)) are strongly continuously differentiable. The 
theorem then says that the probability of transition from a regular state to any other (i.e., 
orthogonal) state in a fixed time interval {s,t] mediated by the propagator 


U;(t,s) :-= T exp (- ‘a H (£) at’) 


decays as O(r~!) for t + oo. The proof is again based on Dyson’s expansion — see, 
e.g., [| Thi 3], Sec.3.3. This is by no means the strongest result which can proved; under 
not very restrictive assumptions on the spectrum of Ho and the regularity of the time— 
dependent part V(t), the transition probability can be shown to decay as a higher power 
of the adiabatic parameter 7 or even exponentially — see, e.g., [| Nen 1,2], [JoyP 1,2]. 
As in the case of the classical limit, we can also ask about expansions of the considered 
quantities in terms of the parameter 7 ; then the term adiabatic approximation is usually 
employed. This concerns not only the transition probabilities but other properties of the 
system as well; for example, the behavior of the eigenvalues and eigenfunctions of Ho 
under the influence of the perturbation V(t) — cf. [Hag 2]; another application can be 
found in [ ASY 1]. 

The adiabatic approximation has the following important feature: if the Hamiltonian 
H(t) depends on time through a time-dependent parameter which makes a loop in the 
parameter space, i.e., it eventually returns to its original value slowly enough, then, during 
this process, the state acquires a phase factor, the so-called Berry phase, which can be 
observed in an interference experiment, when it is compared to the state which has evolved 
without this perturbation — cf. [Be 1]. Interesting effects occur when adiabatic and 
semiclassical approximations are combined — see [ Be 2]. 

Propagators of time-dependent systems are often studied using the so-called Howland 
method, which replaces the original problem by investigation of the operator —i0/0t + 
H(t) on the enlarged Hilbert space L?(IR;H) = L?UIR) @H — see [How 5] and also 
[RS 2], Sec.X.12. The dynamics of time-periodic systems is discussed, for instance, in 
[Ya 2], [YK 1], [EV 1]; Examples 6 and 7 are taken from the last named paper. In the 
nonresonant case the spectrum of U(T,0) is known even to be pure point and dense on 
the unit circle — cf. [HLS 1]. Spectra of monodromy operators for more general systems 
and their stability with respect to perturbations has been an object of interest recently — 
see, e.g., [How 6] and [How 7] for the example of a driven anharmonic oscillator, and 
also the papers on quantum chaos mentioned in the notes to Sec.7.1. 


Section 9.6 There are many mathematical problems related to the time evolution of 
unstable quantum systems. We shall briefly describe some of them; a detailed discussion 
with more references can be found in | Ex], Chaps.1-4. The investigation of general 
features of unstable systems based on assumption (u) is sometimes called the quantum 
kinematics of decay processes, because it concerns properties which are independent of 
the particular form of the total Hamiltonian — see [HM 1], [HLM 1], [ Wil 2]. 

One of the problems formulated in these papers is the inverse decay problem: given 
a continuous operator—valued function V : IR + B(H,) with ||V(t)|| < 1 for all ¢, 
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we look for a Hilbert space H D H, and a unitary propagator U(t) = ent! omer 
such that the corresponding reduced propagator coincides with V. A solution exists 
provided V is of the positive type (see the notes to Sec.5.9). This condition is satisfied, in 
particular, if { V(t) : t > 0} is an operator semigroup; more information on this subject 
can be found in [Da 1], Chap.7; [Ex], Sec.1.4. Stone’s theorem generalizes to operator 
semigroups — cf. [RS 2], Sec.X.8; [Da 2] — allowing us to characterize a semigroup 
{ V(t): t >0} by means of its generator. In this way we can give a rigorous meaning to 
phenomenological non-selfadjoint Hamiltonians which appear frequently, e.g., in nuclear 
physics — see [BEH 1]; [Ex], Chap.4. 

Assumption (u) does not exclude the situation when H{ is invariant with respect to 
U(t), or alternatively 71, is invariant for negative times — see Problem 17a. The decay 
law for a mixed state W is defined by Py(t) := Tr(V(t)*V(t)W) ; its interpretation and 
properties are analogous to the special case W = Ey. The proof of Theorem 5 can be 
found in [{ Wil 2]; moreover, the hypothesis can be weakened: o(H) = JR holds provided 
V(s)V(t) = V(s+t) for all s > 0 and t > T, where T is any positive number — see 
[Sin 1]. The semigroup condition means that the decayed state cannot regenerate; the 
violation of this property for short times is discussed in [MS 1], [Ex 5]. The approximation 
of a decaying state by bounded-energy states is investigated in [Ex 1]. The parameter ¢ 
in Example 4 characterizes the weak CP-violation in neutral-kaon decay. 

We mentioned in Example 6 that the support of the operator—valued measure F‘(-) 
is contained in o(H). It is demonstrated in [Ex 4] that under the minimality condition 
H = (Uier U(t)Hu))i, the opposite inclusion is also valid, so suppF = o(H). The 
argument of Example 6 extends to dim H, > 1 ; the approximation obtained by replacing 
the reduced resolvent by the pole term of the corresponding Laurent expansion is called the 
pole approximation. Justification of this approximation, i.e., estimating the contribution 
of the neglected terms, represents a difficult problem for particular systems. For the 
Friedrichs model it was done in [Dem 1]; a related Galilean-invariant model of two- 
particle decay was analyzed in [DE 2]. 

The Friedrichs model was first formulated in [ Fri 1]. It attracted interest when a 
similar Hamiltonian was used in the so-called Lee model of quantum field theory [Lee 1]; 
the decay problem in the Lee model has been discussed by [AMKG 1}, [H6 1], and many 
other authors. Recently the Friedrichs model with a time-periodic coupling constant has 
been proposed — see [HL 1]. 

To prove Proposition 8 we need the edge-of-the-wedge theorem which claims the fol- 
lowing: suppose the functions F;, 7 = +,—, are analytic in the regions 2; of the upper 
and lower complex halfplanes, respectively, and their common boundary contains an open 
interval J C JR. Assume further that Fj(€) := limpo4 F)(€+ijn) exist uniformly in J, 
are continuous with respect to €, and Fy(€) = F_(€) for all € € J. Then there is an 
analytic function F: (24U9_UJ) + @ such that F(z) = Fj(z) for z €;. For proof 
see, €.g., [SW], Sec.2.5. The implicit-function theorem can be found, for example, in 
{Schw 2], Thms.1II.28 and III.31. 

In quantum—mechanical textbooks the Fermi golden rule is usually derived formally, 
using continuous-spectrum “eigenfunctions”; a critical discussion of this procedure is given 
in [RS 4], notes to Sec.XIII.6. As in the case of the Friedrichs model one can prove that 
the Fermi rule yields the leading-order contribution to the imaginary part of the pole 
position for many other models — see, e.g., [Bau 1], {How 3], {Si 4]. The second of these 
papers presents an example showing that a formal use of the Fermi rule may lead to false 
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conclusions. 

The embedded-eigenvalue perturbation theory which applies to most decay models 
was first studied in [Fri 1]. If the interaction does not fulfil a Friedrichs—type condition, 
then the algebraic way of obtaining the reduced resolvent used in the proof of Proposition 7 
fails. In that case the factorization technique, the idea of which goes back to [Ka 2], is 
often used: one assumes V = B*A for some operators A,B, which allows us to express 
the resolvent by means of the formula of Problem 1.63. Reduced resolvents for different 
types of perturbations have been studied in this way — see, e.g., [How 1-4], [Bau 1], 
[BD 1], [BDW 1]; the difficult part is here usually to check that the results are independent 
of the chosen factorization. 

Another method for the investigation of embedded-eigenvalue perturbations, which 
is particularly suitable for Schrédinger operators, uses so-called complex scaling, i.e., an 
analytic continuation of unitary scaling transformations (corresponding to linear maps y 
in Example 3.3.2) to complex values of the scaling parameter. Adopting certain hypotheses 
about the analyticity properties of the potentials, one can in this way turn the search for 
the continued-resolvent poles into the true eigenvalue problem of some non-selfadjoint 
operator — cf, for example, [ AC 1], [Si 4]. 

A typical example discussed in [Si 4], and also in [ RS 4], Sec.XIL.6, is the helium 
autoionization effect. Its essence is as follows: if we switch off the Coulomb repulsion 
between the electrons in a helium atom, the Hamiltonian will have a continuous spectrum 
with embedded eigenvalues. The repulsion represents a perturbation which makes these 
eigenvalues “dissolve”, turning them into resolvent poles. However, the atom remembers 
the disappeared eigenvalues; they are manifested, for instance, by the cross-section peaks 
that appear near these energy values if electrons are scattered on Het ions — cf. the 
notes to Sec.14.4. 

In addition to scaling, other families of substitution transformations have been contin- 
ued to the complex region to enable treatment of other classes of potentials — cf. [BB 1], 
[Cy 1]; these include, in particular, translations — see, e.g., [HH 1], [AF 1]. An exten- 
sion of the complex scaling to Dirac operators can be found in [Seb 2]. All these results 
need some analyticity hypotheses. Without them it no longer makes sense to identify the 
unstable states with poles of the continued resolvent; one can only prove (for a particular 
class of Schrédinger operators) that the embedded eigenvalues disappear in the continuous 
spectrum under the influence of the perturbation — cf. [AHS 1]. 

There are decay processes that are described by perturbation of isolated eigenvalues, 
which disappear in the continuous spectrum once the perturbation is turned on. This 
concerns, for instance, decays by tunneling through a potential barrier; a classical example 
is represented by the Gamow theory of a-radioactivity [Gam 1]. Unstable states of this 
type are usually called shape resonances; in a mathematical description of such decays 
one can take for the unperturbed system one with the barrier extended to an infinite 
width [ How 2]; at the same time the embedded-eigenvalue perturbation theory can be 
used when the height of the barrier is being blown up — see, e.g., [ AsH 1], [ AS 1], 
[ES 1]. The semiclassical approximation for shape resonances has been studied, for 
instance, in {CDS 1], {CDKS 1], and also in | Nak 1], where a complex distortion is used. 
A perturbation theory for unstable states coming from isolated—eigenvalue perturbations 
has been discussed in [Hun 1]. 

Another example of isolated-eigenvalue decay is the Stark effect, which represents a 
linear perturbation (caused typically by an electric field) to Schrédinger operators with a 
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given potential, in particular, a Coulomb-type interaction — see, for instance, [RS 4], 
Sec.XII.5, and for later results [Her 1], [GG 1], [Nen 1], [Wa 1]. A related and rather 
difficult problem is the so-called Wannier ladder, i.e., a linear perturbation to a periodic 
potential — see [Avr 1], [HH 1], [AF 1], [CoH 1], [BuD 1], [AEL 1], etc. 


Problems 


1. Let {U(t,s) : s,t € IR} bea family of operators each of which is either unitary 
or antiunitary. If conditions (i),(ii) of the definition of the propagator are fulfilled, 
then all U(t,s) are unitary. 


2. Prove: (a) A conservative system whose Hamiltonian has a purely continuous 
spectrum has no stationary states. 


(b) In a system of free particles, functions of the momentum and spin components 
are integrals of motion. 


(c) Integrals of motion need not commute mutually. 
(d) Under which condition is any integral of motion a function of the Hamiltonian? 
3. Let Hs be the subspaces in L?(JR) consisting of the even and odd functions, res- 


pectively. Prove that each of the operators P?}H4 has a simple spectrum. 
Hint: Apply the results of Problem 5.43a to the basis vectors ho, hi , respectively. 


Let Ay(t) be an observable in the Heisenberg picture; then f(Ay(t)) = (f(A)) x(t) 
holds for any Borel function f and all te R. 


= 


5. Let H := Ho+ V with Ho self-adjoint and V Hermitean. Prove (9.3) and the 
corresponding equation for mixed states. 


6. Let U;, U2 be unitary propagators. If U3 : U3(t,s) = U,(t,s)U2(s,t) is a uni- 
tary propagator, then U;(t,s) and U2(r,s) commute for any r,s,t € IR. On the 
other hand, U; : U;(t,s) = U3(t, 7)U3(s,7)—! = U(r, t)Uo(t, s)U;(s,7) is a unitary 
propagator for any fixed re R. 


7. Given a system of free particles, 


(a) verify Example 9.3.2., 


(b) prove that the results of Proposition 9.3.3 remain valid as long as the mean 
values make sense. 


8. Consider again the free-particle system of Section 9.3 and an open bounded set 
M Cc IR” in the configuration space. For any t > 0 and x ¢ M, there is a state w 
such that suppy C M and |dy(x)|? >0 for vy := U(t)p. 


9. Prove Theorem 9.3.6. 
Hint: Check that that U(t) maps S(JR) onto itself and its kernel satisfies the 


equation (i 2 + = = - jmw?2?) Ki(x,y) =0 for any re R. 


10. Verify the calculations in Example 9.3.8. 
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HT, 


Fill in the details in the proof of Theorem 9.4.1. 


Let potential V be differentiable with OV/8z,; bounded; then there is a positive 
Cy such that Sy used in (9.4) differs from the action over the polygonal path 
y= yy, ...,y%5-) by [Sn(y,..., 9%); 2) — $(7)1 < CvilrllooN-*. 

Hint: Use the Taylor expansion at the vertices with the Lagrange form of the re- 
mainder. 

Prove (9.5) for a linear potential, V(x) := (7_1 a;2; +8. 

Hint: Use the relation Sy(y,...,y%);t) = S(y(g,..., 9); t)4+O(N-!) , where 
of” = y+ ajkt?/2m;N?. 

Let I’ be a real Hilbert space. A nontrivial measure yw on I’ such that 


(i) w(M) < oo for any bounded Borel set M CTI, 
(ii) yw is translation—invariant, 


exists iff dimI < oo. 


Hint: The unit ball in an infinite-dimensional [ contains infinitely many disjoint 
balls of radius 1/4. 


Fill in the details in the proof of Theorem 9.5.1. 


Prove: (a) The trajectories of the classical oscillator driven by a time-dependent 
force f(t) are given by 


Dees 1 
a(t) = xcoswt + mw on wt — mu Ph 8) 
p(t) = —mwez sin wt + pcos wt + a(t,s), 
where a, @ are the functions defined in Example 9.5.6, provided the initial 
conditions are x(s) =x and p(s) =p. 
(b) In the same way as in Problem 9, check the expression of the propagator (9.11). 
(c) Compute the functions a and @ for f(t) :=sin Nt. 


(d) Show that if 8w/Q is noninteger, one can choose q,p in such a way that 
the corresponding state (8.2) is an eigenvector of the monodromy operator 
corresponding to (9.11). 


Hint: The action of U(T,0) on tq,» can be written down explicitly using Exam- 
ples 9.3.5 and 5.5.1b. 


Using the notation introduced in Section 9.6, 


(a) find an example of Hilbert spaces Hy C H and a unitary propagator U on H 
such that assumption (u) is valid and Hd is U(t)-invariant for all t > 0. 


(b) prove Proposition 9.6.1. 


Hint: (b) The group property would mean that V(t) is a partial isometry. 
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If w is a finite-energy state, the initial decay rate satisfies Py (0+) =. 
Hint: Since |(w,U(t)~)|* < Py(t) < 1 it is enough to differentiate the lower bound. 


Find the decay laws 


(a) for the truncated Breit-Wigner state y,, where 


1 -1 
dbo, BY be) = No x1-09(A) ((A=2o)?+ 31) dd, 


and N, is the normalization factor, 
(b) for the neutral kaon states of Example 9.6.4. 


Compute the function v(t) of Example 9.6.6. 


Let H, be the Friedrichs-model Hamiltonian. Prove 

(a) Hg is self-adjoint on D(H,) := { ( : ) :aeEC fe D(Q) } for any g € R, 

(b) the point spectrum of Hy is simple, and a positive A can be an eigenvalue 
only if v(A) = 0, 

(c) Hy has at most one eigenvalue in the interval (—oo,0], and this happens if 
g? > ro (fe €-"u(€)|2 dé)". If v(0) #0, then a negative eigenvalue exists 
for any nonzero g. 


Hint: (c) The function A+ f>°(A—€)7~!|v(€)|?aé is monotonic in (—oo,0). 


Fill in the details in the proof of Proposition 9.6.8: check that the function J(-) is 
continuous in (0,00) and limfem—ro4) W(E +i, 9) = g?1(A) F mig?|v(A)|? is valid 
for any A>0. 

Hint: The function (|v(-)|?— |v(A)|?)(A — -)7!_ is bounded in the vicinity of € = 2. 


Verify the Taylor expansion of Theorem 9.6.9. 


Chapter 10 


Symmetries of quantum systems 


The problem we are going to discuss now, namely various symmetry transformations, 
is studied and used in almost all parts of physics; in some of them, e.g., in the 
general theory of relativity, it is built into the very foundations of the theory. The 
importance of symmetry properties for quantum physics stems basically from two 
facts. First of all, they are related to the different ways in which a system is observed. 
We already know that quantum systems are generally affected by measurements, and 
therefore the state after a measurement also depends on the particular observer who 
performed it. What is even more important, the object on which the transformations 
are studied is the state Hilbert space, and as a consequence, quantum systems may 
exhibit additional symmetries which are not related to the underlying space-time 
structure. 


10.1 Basic notions 


Any quantum system can, at least in principle, be investigated by different experi- 
mentalists, and each of them eventually will convert results of the observation (with 
some assistance of a theoretician, maybe) to a description of the system along the 
lines discussed in the preceding chapters: a Hilbert space will be chosen to describe 
pure states, certain operators will be associated with observables, etc. A natural 
consistency requirement is that all this must allow a translation to the language of 
another individual or team who has observed the same object in such a way that the 
results are in mutual agreement. In particular, the probability of transition between 
any pair of states must be the same for the two observers. 

There are numerous ways in which the two observations may differ, and indeed 
some of them are related rather to the language, 7.e., the system of notions used by 
the theory; recall different experimental setups used to measure the same observable 
which we mentioned in Section 7.1. Apart from that, there are others which form 
the matter of our interest here. 


10.1.1 Examples: (a) The other apparatus has a different location, it is shifted, 
rotated, or both. It can also be operated at some other time than the first 
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apparatus was. This amounts to using another coordinate system or a clock 
adjusted to a different initial instant. 


(b) We also can employ different coordinate-axes numbering from that of another 
observer. If such a renumbering is not cyclic it cannot be achieved by rotating 
the second apparatus with respect to the first one; instead it corresponds to 
regarding the object of investigation through a mirror. 


(c) Different labels can be used for particle charges, baryon numbers, and other 
“internal” properties. 


(d) We can record a process and later scan its time evolution deliberately choosing 
the direction in which the tape is run through the projection machine. 


These and similar relations between observers have to correspond to some relations 
between results of their measurements. For definiteness, consider a pair of observers 
O and O’, both performing experiments on the same system S. If they both rely 
on the standard scheme of quantum theory, they will describe (pure) states of S 
by rays in a Hilbert space. Strictly speaking, each of them could choose his or 
her own Hilbert space for this purpose but this would cause no complications. We 
know from Section 7.1 that the dimension of the state space is in fact the number 
of independent states in which the system can exist; hence a choice of spaces of a 
different dimension would eventually lead to a discrepancy. Since all Hilbert spaces 
of the same dimension are mutually isomorphic, we may assume that the state space 
is the same for the two observers and call it as usual H. 

The ways in which the observers associate rays or statistical operators with states 
of the system are clearly determined by the sets of observables they use. The simplest 
possibility is to assume that both observers are endowed with same experimental 
equipment, i.e., that they use the same operator to describe a particular observable 
(however, there are alternatives — see the notes). 

If the system is in a given pure state, the first observer will describe it by a 
ray W° , while the second will associate with the same state a ray V® . The above 
mentioned translation means that there is a bijective map Too : Fo — Fo: between 
the sets of rays they ascribe to admissible states in their formalisms. Now we can 
repeat the argument used in the previous chapter for time evolution. The transition 
probabilities determined by the two observers have to be the same, 


P(O?,W%) = P(6°, VW), 


for any 6°,U° € Fo and 6°, W% € Fo. Then it follows from Wigner’s theorem 
that the map Toro between the rays can be realized by means of a unitary or 
antiunitary operator on H ; with the usual abuse of notation we shall employ the 
same symbol Too for it. 

The observers can naturally ask about the validity of the superposition principle. 
The correspondence between their results means, in particular, that the superselec- 
tion rules determined by them are the same; if the first observer will rule out the 
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existence of a nontrivial superposition of a proton and neutron, the other must ar- 
rive at the same conclusion whatever names he or she uses for these particles. If 
the system is not coherent, the operators Toyo are not unique in general; however, 
this nonuniqueness is removed easily provided we make another natural assump- 
tion, namely that Toro is reduced by all coherent subspaces of the system. This 
discussion may be summarized in the form of another postulate: 


(Q5a) the vectors which observers O and O’ use to describe the same state of the 
system are related by a unique unitary or antiunitary operator Too , which 
is reduced by all coherent subspaces. 


Let us remark that the similarity between these considerations and those of Sec- 
tion 9.1 is not accidental. In view of Example la, time evolution is nothing else 
than a particular case of the transformations discussed here. 


10.1.2 Remarks (on the active approach): (a) The way in which we have treated 
the problem up to now is sometimes called the passive approach to the prob- 
lem of symmetry transformations. Another possibility is to consider a single 
observer who performs experiments on a pair of systems S, S’, the second of 
which being obtained by some transformation performed on an identical copy 
of S and vice versa. With this fact in mind, the observer will ascribe to them 
the same Hilbert space 1, but a particular ray will correspond in general to 
different states for the two systems. Conversely, if a state of S is described by 
aray Ws, another ray Vs corresponds to the same state of the system S’. 
A straightforward modification of the above argument allows us to associate 
with the given pair S, S’ a unique operator T*’S which is unitary or antiuni- 
tary, reduced by all coherent subspaces, and maps each Vg on the respective 
Ws. Moreover, if S’ is obtained from S by the same transformation that 
has related the observers O and O’ in the passive approach, then 


Vile a (10.1) 


follows from the consistency requirement (Problem 1); this makes possible an 
easy translation between the results obtained in the two approaches. 


(b) It is not difficult to imagine a transformed system obtained by changing its 
position in space or time. On the other hand, in some of the situations men- 
tioned in Examples 1b-d the active approach cannot be applied because the 
transformed system does not exist —- see Example 10.2.5 below. An attentive 
reader may object that the same applies to the observers: after all we almost 
surely have our heart on the left side, to say nothing about the unique ori- 
entation of biological time and the fact that we are composed of matter and 
not antimatter. A partial explanation why the passive approach can never- 
theless be used in such situations has already been indicated in Examples 1, 
namely that the transformed observation can be realized by technical means 
like mirrors, recording devices, etc. There is a deeper reason, however. What 
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we here briefly call an observation is in fact a complicated cognitive process, in 
which we are allowed to use any possible mental image of the physical object 
or process under investigation; the sole criterion for the success of the resulting 
theory is that its predictions are correct and free from contradictions. 


(c) Both approaches are nothing but theoretical idealizations. If the same experi- 
ment is performed, say, in CERN and Fermilab, then two teams equipped with 
analogous devices each measure on their own proton beam; hence we have, 
loosely speaking, two observers and two systems. In view of relation (10.1), 
however, it is no problem to compare the results in such a more realistic setup. 


Up to now we have considered pure states only. A generalization to the mixed- 
state case is straightforward. If the first observer concludes that the state of the 
system is a mixture of pure states wi) with some weights w;, 7 = 1,2,..., then 


the second has to describe it by the mixture of i with the same weights. Since 
the corresponding projections Ey,, Ey, are related by Ey = Toro Ey;T hos 2S 
reasonable to extend the postulate (Q5a) as follows: 


(Qda) the statistical operators W°?) and W'°) which observers O and O’, re- 
spectively, associate with the same state of the system are related by W() = 
TorioW T55 , where Too is a unique unitary or antiunitary operator which 
is reduced by all coherent subspaces. 


Notice that since the relation contains the transformation operator together with its 
inverse, it maps linear statistical operators into linear operators again, even if Toro 
is antiunitary. 

In some situations only a pair of observers may be involved, for instance, when 
the transformation in question describes a mirror image or time reversal. However, 
more often larger families of transformations have to be considered. Then the opera- 
tors relating different descriptions have to satisfy a natural consistency condition, 
because the maps 0?) 1 WO) + Wl and WO) 1% vw”) must yield the same 
state for any ray ¥‘°) corresponding to a realizable state, i.e., the corresponding 
operators may differ at most by a phase factor, 


Toro = wTororTovo , 


where |w| = 1. On the other hand, the number w need not be the same for different 
triplets {O,O’,O”}. 

If we want to proceed further with the analysis of the operators Toro , we have 
to replace the rather vague notion of a transformation between the observers which 
we have used up to now by something more specific. It is useful to divide the 
family of all possible transformations into smaller classes within which different 
transformations can be compared. We shall assume that a particular class forms a 
group, or equivalently, that there is a group G to which the class can be bijectively 
mapped, each transformation O ++ O' being associated with an element gEeG. 
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10.1.3 Remark: Since obviously any family of transformations is equipped with 
the identical and the inverse elements, the only problem concerns the composition 
(group multiplication). Notice that its existence is not automatic. For instance, in 
time evolution of a nonconservative system it generally makes no sense to compose 
the transformations described by the operators U(t,r) and U(r’,s) unless r=r’. 
On the other hand, one can find numerous examples of transformation families which 
have a natural group structure. 


We shall therefore consider a group G of transformations, for the present mo- 
ment without a more detailed specification, and label the corresponding operators 
by the respective group elements, Too =: T(g). The above consistency requirement 
then reads 


T(g')T(g) = w(9’, 9)T(9'9) (10.2) 


with |w(g’,g)| = 1; we have now explicitly indicated the dependence of the phase 
factor on the group elements involved. Since the composition in G is associative, the 
operator T(g"g'g) can be expressed in two different ways; this yields the condition 


w(9", g')w(9"9', g) = w(9', 9)w(9", 9'9) (10.3) 


for any g,g’,g" € G. Because of the presence of the factor w(g’,g) in (10.2), the 
map T(-) may not be a representation of G. The following more general concept is 
introduced: a map T: G — B(H) is called a projective representation of the group 
G ifthereisw: GxG— {zeC: |z| = 1} such that conditions (10.2) and 
(10.3) are satisfied; the function w is then the multiplier of T. Having introduced 
this notion, we can summarize the above discussion as follows: 


(Q5b) the operators T(g) corresponding to a given group G of transformations 
form a projective representation of it on the state space. 


Let us remark that while in most cases of physical interest the multipliers are trivial, 
i.e., they can be included into the operators T(g) so that we have a true represen- 
tation of G’, there are situations where the phase factor not only cannot be avoided 
but also has physical consequences —- see Remark 10.3.2 below. 

With these prerequisites, we are now prepared to discuss symmetries, i.e., trans- 
formations which do not affect observed properties of the system. Suppose that our 
two observers O and O’, whose descriptions are related by the operator T'(g) , 
measure an observable A, 1.e., they perform the set of yes-no experiments corre- 
sponding to the projections E4(A) for all A ¢ B. They end up with the collections 
of probabilities w(A, A;W) and w(A, A;W). The observable is said to be 
preserved by the given group of transformations if the probabilities are the same, 


w(A, A; WO) = w(A, A; WO) 


for any Borel A € JR and all states of the system. Furthermore, we say that the 
system is invariant with respect to a group of transformations G (alternatively, 
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that it exhibits a symmetry with respect to G or that G is a symmetry group of the 
system) if its total energy is preserved by the group. 

Consider first the situation when the operator T(g) is unitary. Postulate 
(Q5a) then yields for any vector y representing an admissible pure state the iden- 
tity (, Ea(A)v) = (b,T (9g) Ea(A)T(g)W) ; mimicking the argument of Proposi- 
tion 7.5.1, we find for any A € B 


Ea(A) = T(g)7'Ea(A)T(g), (10.4) 


so by the spectral theorem the observable A commutes with T(g) (see also Prob- 
lem 2). In the antiunitary case we have 


w(A, A; YO) = (T(g)p, Ea(A)T(9)b) = (T(g)-*Ea(A)T(Q),¥), 


and since the operators E4(A) are Hermitean, the right side has to be equal to 
(E4(A)v,~) for any ~ belonging to some coherent subspace; hence we arrive at 
the same conclusion (Problem 3c). On the other hand, the commutativity of A and 
T(g) easily implies w(A, A; W) = w(A, A; W) for all A € B and admissible 
W) | so we get the following assertion (see also the notes). 


10.1.4 Proposition: Suppose that G is a group of transformations of a quantum 
system and T(-) is the corresponding representation of G on the state space; then 


(a) an observable A is preserved by G iff the operator A commutes with T(g) 
for all g € G. In particular, the system is invariant under G iff this is true 
for its Hamiltonian, 


(b) if A is preserved by G, the restriction T(-)} F4(A)H is a representation of 
G for any AC R;; in particular, T(-) is reduced by each eigenspace of A. 


In the present terminology, an observable A of a conservative system is therefore 
an integral of motion if it is preserved by the group of time translations, and any 
such system is naturally invariant with respect to that group. 


10.2 Some examples 


The general framework for treatment of transformations of quantum systems and 
their symmetries constructed up to now covers many different physical situations, 
as Examples 10.1.1 indicate. On a mathematical level, this diversity is manifested 
in two ways: 


(i) different types of transformation (symmetry) groups are involved. Some of them 
are continuous, most frequently Lie groups; others are discrete, often finite. 
With the help of standard group-theory notions, one can classify the transfor- 


mation (symmetry) groups further, for instance, divide them into commutative 
and noncommutative, etc. 
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(ii) the same group G may have different nonequivalent representations, some- 
times even a vast family of them, to which different transformation properties 
of the system correspond. To classify them, one has to find all irreducible 
representations of G and select those, which fit the description of the sys- 
tem under consideration in view of their dimension, spectra of the involved 
operators, etc. 


Rather than attempting a systematic overview, we are going to discuss here several 
typical and physically important situations. Let us begin with the simplest example 
of the continuous transformation group, which describes the case when the observers 
are mutually shifted. 


Examples 10.2.1: (a) Translations on the line. Consider first the spinless particle 
of Example 7.2.1 and suppose that the observers parametrize the line by shifted 
coordinates x and z’ := x+a for a fixed a € IR. Since they are investigating 
the same particle, it is natural to assume that they will describe its state 
by square integrable functions 7% and w’, respectively, which are related by 
wy’ (a’) = o(z) , or equivalently 


(T(a)p)(x) = Y'(x) = Ya —a) 


for any x € R. The transformation operators T(a) are unitary and form a 
continuous group if the parameter a runs through the reals; we know from 
Example 5.5.1b that {T(-a) : a € R} is generated by the momentum 
operator P (in the standard system of units, by h7'P). 


In view of Stone’s theorem and the definition of commutativity, any function 
of the momentum is preserved at translations; in particular, this concerns the 
kinetic energy of the particle. On the other hand, mimicking the argument of 
Example 6.7.2e we find that a function of position Q does not commute with 
P unless it is trivial, i.e., a multiple of the unit operator. Let the particle be 
described by the Hamiltonian 


Tae ; 
Boe + V(Q); 


as earlier we suppose that the right side is self-adjoint (or at least e.s.a., so 
we can replace it by the closure of the operator; we postpone discussion of 
the conditions under which this is true to Section 14.1). We see that it is 
necessary and sufficient for the invariance with respect to translations that 
the potential V is constant, which means essentially that the particle is free, 
because the choice of the origin on the energy scale is a matter of convention 
in nonrelativistic quantum mechanics. 


Let us stress, however, that this conclusion concerns the whole group T; of 
translations of the real line; it does not exclude the possibility of invariance 
with respect to a subgroup of it. Consider, e.g., the situation where the func- 
tion V is periodic, V(x) = V(a+nb) for any integer n, x € IR, and some 
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b > 0. Then V(Q) obviously commutes with the operators T(nb), which 
form a subgroup in 7, that is naturally isomorphic to the additive group Z 
of integers, and the same is true for the Hamiltonian H. This yields at the 
same time an example of a symmetry group which is discrete but infinite. 


(b) n-dimensional translations. The previous considerations easily extend to sys- 
tems with n configuration-space degrees of freedom. The translation group 
T, of IR” is an n-parameter commutative Lie group, which is represented on 
the state space L?(JR") by 


(T(a)p)(z) = p(x - a) (10.5) 


for any a := (a@1,...,@n) € JR”. Due to Example 8.2.3 the representation 
is unitary and the one-parameter subgroups {T(—ae™)) : a € R} with 
(e*)), := 6j, are generated by the momentum component operators P,, k = 
1,...,n. Since {P,,..., P,} is a CSCO, the only observables preserved by the 
translation group are functions of the momentum; in particular, a system of 
spinless particles described by the Hamiltonian (9.2) is translation—invariant 
iff it is free, i.e., the potential is a constant function. As above, however, an 
interacting system can be invariant with respect to a subgroup of T,, provided 
the potential exhibits such a symmetry; in a multidimensional case the number 
of different subgroups is much larger. They can be continuous, discrete, or 
both. For instance, if n = 2 and V(zi,22) = f(z2) where f: R— R is 
periodic, f(x) = f(z+n), then the corresponding two-dimensional particle 
interacting with this potential is invariant with respect to the group 7, x Z. 


Before we proceed further, it is worth mentioning that there are important 
groups of transformations which can never play the role of a nontrivial symmetry. 
This happens, for instance, if we extend the group of the previous example to include 
all phase-space translations. 


Example 10.2.2 (Heisenberg-Weyl group): The family of Wey] operators intro- 
duced in Section 8.2 forms a projective representation of the translation group Top 
(Problem 4a). Alternatively, these operators give rise to a true representation of the 
Heisenberg-Weyl group defined as the (2n+1)—parameter set G = { g(s,t,u): s,t € 
IR”, u € IR} with the binary operation 


1 
g(s,t,u)g(s',t',u’) := g (sts't+tutu + a(t s'-s-t)] 


It is a (2n+1)-dimensional noncommutative Lie group. The corresponding Lie 
algebra (dubbed the Heisenberg-Weyl algebra) is spanned by the elements GDienC , 
which satisfy the relations 


[pes D5] = (qk; 95] = (pr, c| Be (ax, ¢] =0, [Pk qj] = —1654C 


for 7,k = 1,...,n, ; notice that in distinction to canonical commutation relations it 
does have a finite-dimensional representation (Problem 4c). 
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In this way the Schrédinger representation of the CCR yields representations of 
two closely related transformation groups which are useful for many purposes; just 
recall the role Weyl operators play in the definition of coherent states. Nevertheless, 
the set {W(s,t) : s,t € JR"} is irreducible in L*(JR") by Problem 8.14c, so an 
observable is preserved by the corresponding transformations only if it is trivial, 7-e., 
a scalar operator. 


10.2.3 Examples: (a) Rotations in the plane. Suppose that the observers use 
coordinate frames which are rotated mutually by an angle 9, i.e., they describe 
a point in the plane by the coordinates [x1,22] and [x4,2], where x) := 
Z cos8 + x2 sin, x, := x, sin? — x2 cos9. The most natural choice for the 
transformation operator is then 


(T(8)w) (21,22) :-= U(21 cos O — xe sinO, x; sin6 + x2 cos8). 


By Problem 5.51, the family {T(@) : 6 € R} forms a continuous unitary 
group, which, in addition, satisfies the relation T(0) = T(6 + 27n) for any 
n € Z ; its generator is the angular momentum operator Ls := Qi P2 — QoP,, 
i.e., T(@) = e®43.. According to Example B.3.3a, the operators T(@) form a 
representation of the group SO(2) =7,/Z. 


(b) Spatial rotations. In contrast to Example 1b, an extension to the case of 
rotations in JR*® (as well as to higher dimensions) is not straightforward, 
because the corresponding group SO(3) is not commutative (Problem 5b). 
On the other hand, any rotation can be composed of rotations around the 
axes (Problem 5c) to which the results of the previous example apply; the 
operators representing the rotations can therefore be expressed, e.g., as 


To(a ? B ) 7) = gue | ’ 


where 
by = oem QePh, F=1,2,3, (10.6) 
kl 


with €;,; being the Levi-Civita symbol, are the generators of the correspond- 
ing one-parameter subgroups and a,(,y are the parameters of the rotation 
(usually called Euler’s angles). 


The vector-valued observable L = (Ly, L2,L3) is called the angular momen- 
tum; using the relation (7.5) we can write explicitly, for instance, the action 
of the operators L; on elements of S(JR*). It is also easy to see that the ope- 
rators L; are e.s.a. on S(R*) and L;S(UR*) C S(UIR®) (cf. Problem 5.51); 
the last mentioned inclusion means that S(JR°) belongs to the domain of the 
operator L? := }>*_, L*. In Section 11.5 below we shall demonstrate, using 
spherical coordinates, that L? is unitarily equivalent to I @ A, where the 
operator A is given by (11.17); due to Proposition 11.5.2 it has a pure point 
spectrum, o(A) = {1(/+1): 1/=0,1,...}. 
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It is straightforward to check that the angular momentum components satisfy 
on S(JR?) the commutation relations 


[L;, Ly] = tejaLi (10.7) 


(with the summation convention), which means that they form a representa- 
tion of the Lie algebra so(3). Hence the relation between the rotation group 
and its Lie algebra gives rise to a relation between their representations. Fur- 
thermore, both of them are reduced by eigenspaces of the operator L? ; this 
fact can be used to simplify the treatment of systems, which are invariant with 
respect to rotations — see Section 11.5 below. 


Particles with a nonzere spin have multicomponent wave functions, and therefore 
operators representing rotations in this case have a “matrix” part as well. 


Example 10.2.4 (spin rotations): As a kind of angular momentum, spin has again 
to satisfy the relations (10.7). We know that it is indeed the case; we have con- 
structed the corresponding (2s+1)—dimensional irreducible representations (7.11) 
in Example 7.5.4b. It is therefore natural to associate with rotations operators in 
the spin state space €2°+! defined by 


aaa: B, 7) s eomnernnaue c 


They form a unitary representation of SO(3) asin the preceding example. However, 
there is an important difference. To illustrate this, consider the rotation of the 
angle 27 around the third axis. In view of (7.11) and functional-calculus rules, 
T(2x,0,0) = (—1)?°1, which differs from the unit operator if the spin s is half- 
integer; this conclusion extends to the rotation of 27 around any axis, because the 
corresponding operator is unitarily equivalent to T(27,0,0). 

Hence if we observe a particle of a half-integer spin, for instance, an electron, 
then the coordinate system must be rotated twice, i.e., by 47, to arrive at the same 
spin state. This property manifests that the operators T,(a,@,y) in fact form a 
representation of the simply connected group SU (2) , which is related to the rotation 
group by SO(3) = SU(2)/Z_ — cf. Example B.3.3b and Problem 6. 

If we return to the full state space L?(JR?; €?%+!) of a particle with spin s, the 
spin coordinate operators are, due to Example 7.5.4b, replaced by S; := I, @ Sj. 
On the other hand, the “configuration—-space part” of the angular momentum (usu- 
ally called orbital in this situation) is L,; := L; ® I, ; it can be expressed in the form 
(10.6) by means of the corresponding position and momentum operators (Prob- 
lem 7a). The total angular momentum of the particle, whose components are repre- 
sented by the operators J; := L; + 5,;, then generates a representation T(-) of the 
rotation group in the same way its orbital and spin parts do; moreover, 


T(g9) = T.(g) ® T;(g) 


holds for each element g = (a,(,) of the group. The representation T(-) is, in 
general, reducible; we return to this problem in Section 11.5. 
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The rotation group SO(3) considered in the above examples is a subgroup in the 
group O(3) of all transformations of the configuration space that preserve the vector 
length. The last named group has two connected components because det R = +1 
holds for any orthogonal matrix R ; due to this fact, any element of O(3) can be 
composed of a rotation and a discrete mirror transformation. 


Example 10.2.5 (space reflection): The transformations connecting observers, who 
use the mirrored coordinate systems, form the simplest nontrivial group consisting 
of two elements. Its representation on the space L?(JR*) can be constructed by 
means of the reflection operator R of Example 3.3.2: the wave functions are related 
by P:=7R, 1.e., (Pw)(x) = nw(—2x), where 7 is a phase factor. Since the group 
property requires P? = J, there are just two nonequivalent representations of this 
type corresponding to 7 = +1. It is easy to see that the position and momentum 
operators anticommute with P, i.e., 


PQ;P=-Q;, PP,P=-P,;, j=1,2,3; (10.8) 


this in turn implies that the angular momentum is preserved by the space reflection, 
PL;P = L;. If the particle has a nonzero spin, in agreement with experience we 
postulate the same relation for the spin-component operators, so the reflection is 
represented in this case by P:= P@J, and PS;P = Sj; ; the analogous relation 
is then also valid for the total angular momentum. 

The operator P, however, is Hermitean at the same time, so it represents an 
observable. It is called the parity; it has a dichotomic character since the eigenvalues 
of P are +n (cf. Problem 8). It should not be confused with the internal parity, 
which is the number 7 specifying the type of reflection-group representation. The 
latter is also an observable, but of the type mentioned in Example 7.5.4b: with each 
of the known elementary particles we can associate just one value of the parameter 
n. The same is true for composite systems which we shall discuss in the next 
chapter: the parity operator is then of the form 7R ® lint, where R means the 
space reflection on the configuration—-space part of the state space, Jing is the unit 
operator corresponding to the internal degrees of freedom and internal parity 7 is 
the product of the internal parities of the constituents. 

It is clear from (10.8) that simple systems such as a free particle are invariant 
under space reflections, so by functional—calculus rules, the parity is an integral of 
motion. In general, this is not true for operators (9.2) unless the potential V is an 
even function. This is true, fortunately, for isolated nonrelativistic many-—particle 
systems, where the interaction Hamiltonian typically depends on the distances of 
particles only. On the other hand, there are isolated systems which are not invariant 
with respect to the mirror transformation. A famous example is the radioactive 
nucleus Co®, which emits electrons with a smaller probability in the direction of 
the nucleus spin than in the opposite direction (see the notes). 


10.2.6 Remark: We have mentioned already that elementary particles are grouped 
into charge multiplets such as a nucleon, a pion, etc. The corresponding Hilbert spa- 
ces are finite-dimensional and support representations of SU(2) and of the reflection 
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group analogous to those discussed in the preceding two examples. In particular, 
the operators I;, 7 = 1,2,3, that generate the represention of SU(2) are called 
(components of) the isotopic spin (or isospin). They can again be expressed in the 
form (7.11); in a similar way we introduce the isotopic parity. However, there is an 
important physical difference between the two cases. The isospin algebra contains 
the charge operator which equals 


1 
Q=13+ 5(B+S), 


where the numbers B, S' are the baryon number and the strangeness, respectively, 
characterizing the type of particles involved. Since Q defines a superselection rule, 
only some isospin transformations lead to physically admissible states. 


Another important example of a finite group of transformations is obtained if 
we consider permutations in a system of N particles; this gives rise to a symmetry 
group if some of the involved particles are identical. We postpone the discussion of 
this case to Section 11.4. 


Example 10.2.7 (time reversal): As in the previous example, transformations 
connecting observers, who use different directions of time (see Example 10.1.1d) form 
the two-element group. However, the operator T representing the time inversion 
is antiunitary. To justify this choice, first consider a system which is invariant with 
respect to switching of the time direction. By Proposition 10.1.4 its Hamiltonian 
commutes with T , and therefore the corresponding propagator satisfies the relation 
TU(t)T = U(—t) for all t € IR (cf. Problem 3b) as it should be the case (with 
a unitary T, the last relation would require the two operators to anticommute, 
TH = —HT, so the sign of the energy will be changed in the transformed system). 
However, since the operator T should be the same for ail systems with the same state 
space, independently of a particular Hamiltonian, it is also reasonable to assume the 
antiunitarity in the general case. 

In the simplest case of a spinless particle we define the time-reversal operator 
as T := K, where K is the complex conjugation, (Kw)(x) = ~(az). In general, 
one may add a phase factor 7 to the right side (the time parity; in contrast to the 
space reflections the group condition imposes no restriction on its value due to the 
antilinearity of J), but in quantum mechanics, where the numbers of elementary 
particles are preserved, there is no need for it. The basic observables transform as 


TOT =, TRT=-P, POP -y, pee Ge) 


(Problem 9a). For a particle with spin s we put T := K @V, where V isa unitary 
operator on the spin space €?*+!. The relations (10.9) are then obviously valid for 
the observables Q;, P;,and L;. It is natural to assume that the spin component 
operators have the same transformation properties, TS;T = -S;, 7 = 1,2,3. It 


appears that this condition determines the operator V uniquely up to an overall 
phase factor: using the same basis as in (7.11) we have 


= —inm 
Vian! = m,—m! € 
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(Problem 9b). In particular, the time-reversal operator for a particle with spin ; 


is T = K @ 02, where a2 is the Pauli matrix. 


To conclude this brief survey, let us add a few comments and observations. 


10.2.8 Remarks: (a) The representations of the continuous transformation groups 
considered above are typically generated by operators associated with some 
distinguished physical observables: translations are related to momentum com- 
ponents, rotations to the angular momentum, etc. These correspondences have 
the same fundamental importance as the transformation behavior expressed 
by (Q4b). At the same time, important observables can also be related to 
some discrete transformation groups as Example 5 shows; however, there is 
no general rule which would tell us which groups have this property and what 
the meaning of the corresponding observables is. 


(b) Suppose that an observable A is associated with a transformation group G, 
either as a generator of its representation or as a representative of a group 
element itself. If the system is invariant with respect to G, then its Hamil- 
tonian H commutes with the operators T(g), and therefore also with A, so 
the latter is an integral of motion. This correspondence between symmetries 
of a system and the existence of conserved quantities is sometimes referred to 
as Noether’s theorem; it is valid not only in quantum theory. 


(c) The symmetries considered in the above examples have a rather straightforward 
interpretation. It may happen, however, that the system is invariant under an 
additional group which is less obvious (see, e.g., Problem 10 and the notes); 
then we speak about a hidden symmetry. 


(d) Transformation properties can be expressed in terms of other groups than dis- 
crete or Lie groups. As an example, consider a particle in a time-independent 
electromagnetic field (Problem 11). The Hamiltonians H(A) and H(A’) as- 
sociated with vector potentials A and A’ := A-—VA are unitarily equivalent 
by means of U, := e**(@). The family of these operators forms an Abelian 
group whose elements are parametrized by functions A, and therefore it is 
infinite-dimensional as a vector space. 


(e) The state space can support two representations of the same group, both of them 
physically meaningful — cf. Example 1b and Problem 12. On the other hand, 
the fact that there is a representation of a group G on the state space, which 
commutes with the Hamiltonian, does not imply that G is a symmetry of the 
corresponding system; the point is that G need not be a transformation group. 
As a simple example, consider a family of linearly independent functions f; : 
R® — R, j =1,...,n, and denote f;(P) := f;(P:, P,P). The operators 
exp (Sof ta; f,(P)) generate a representation of the Abelian group T;, on 
L? (UR?) which commutes with the free-particle Hamiltonian; however, they 
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can be given a reasonable physical interpretation (invariance with respect to 
translations) only if f; are linear functions. 


An inspection of the above examples shows that a distinguishing feature of 
physically relevant space-time transformations is that the corresponding group 
is represented by substitution operators (cf. Example 3.3.2), i.e., that if the 
group relates points x, x’, then the value of the wave function at x is deter- 
mined only by the value of the transformed wavefunction at z’ ; this is why 
such operators are often called local. 


10.3 General space-time transformations 


The translations, rotations, and space reflections considered in the examples of the 
preceding section together with time translations may be included in a wider group 
of space-time transformations. However, this group is not unique; its choice depends 
on whether the system under consideration is nonrelativistic or relativistic. We are 
going now to review briefly properties of the space-time transformation groups in 
these two cases; for simplicity we restrict ourselves to continuous transformations, 
i.e., we shall not speak about space and time reflections. 

Consider first the nonrelativistic case, where the general continuous transforma- 
tion between two inertial frames is given by 


v= Re+vt+a, t=t+b, (10.10) 


where R € SO(3) corresponds to rotations, vectors a, v € JR® to translations and 
boosts, respectively, and b € JR is the time shift. Together these transformations 
form a ten—parameter group whose elements g := (b,a,v,) satisfy the following 
composition law 


jg = (b+, Rata'+v'b, Rut’, RR); (10.11) 


we call it the Galilei group and denote it as G. We also introduce the Euclidean 
group € as the six-parameter subgroup of G including the translations and rota- 
tions. Using the results of Examples 1, 2, and 4 of the previous section we see that 


for a particle with spin s the group € is represented by the operators T(R,a) on 
I? UR’; C24+1) defined by 


(T(R,a)b)m(2) = 32 Smpa(R)Y(R-'Ma—a)), 


m=—8 


where S(R) is the matrix representing the spin-rotation operator T,(a,,) for 
R = R(a,@,y). By Example 10.2.1b and Proposition 5.7.3b, the free-particle 
Hamiltonian H := = P® is invariant under translations and spin rotations; in 
combination with the results of Section 11.5 below this shows that it is invariant 
with respect to the whole group €. 
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10.3.1 Proposition: Wave functions of a free particle of mass m and spin s 
transform under the Galilei group by the operators 


CG) = 2 TR a vb) ee (10.12) 


which form a projective (unitary, strongly continuous) representation of G with the 
multiplier w(g’,g) := exp (im(v' -Rla-a’-Ru—Rv- v'b)) : 
Proof: As a product of unitary operators, U(g) is unitary, and in a similar way we 


obtain the strong continuity of the map U(-). Finally, the relation U(g')U(g) = 
w(g’,g)U(g'g) is checked by a straightforward computation (Problem 13). ff 


-Note the sign in the exponent of e*#® in (10.12), which is due to the passive 
approach we use. In the active approach we obtain by (10.1) the standard evolution 
operator which shifts the system on 6 in time. 


10.3.2 Remark: There is an important consequence of the fact that Galilei-group 
transformations are realized on the state space by means of a projective representa- 
tion. The reason is that one can find elements ga, gy € G such that the composition 
9592 '9vga equals the unit element but the product of the corresponding operators 
differs from J by a nontrivial phase factor (Problem 15). The same is true for 
representations describing transformation properties of many—particle systems. In 
Section 11.5 below we shall see that these are given by U(g) := Covey U)(g) , where 
U‘9)(.) are representations referring to the particles of which the system consists; the 
phase factor of Problem 15 is in this case replaced by e*@*” , where M := Pe, m; 
is the total mass of the system. 

Suppose now that a state of such a system is a nontrivial superposition of states 
Vin, Um with masses m and m’, respectively, i.e., that it is described by am + 
Bum for some Wm, Wm € H and nonzero a, 3. The argument sketched above then 
shows that ae”? qm +6 em has to describe the same state for all a,v € JR°; 
however, this is impossible unless m = m’. It follows that in nonrelativistic quantum 
theory states with different masses belong to different coherent subspaces; this fact 
is usually referred to as the Bargmann superselection rule. 


Let us now pass to the relativistic case. Following the usual convention, we 
denote the space-time points as x = (Zo, 2) ; the components will be numbered by 
the Greek indices p,v = 0,1,2,3 while the Latin indices, j,k = 1,2,3, are reserved 
for space coordinates. Space-time transformations are now described by another 
ten—parameter group defined by 


z= Azr+a, (10.13) 


which is called the Poincaré group and is denoted as P. Here the a are four- 
dimensional vectors representing space-time translations, and the A are real matri- 
ces, which preserve the indefinite bilinear form (x, y) + ry := royo — ZY. The cor- 
responding six-parameter subgroup C is called the Lorentz group; since the form 
can be expressed through zy = z*y, = gwa'y” with gw := diag(1,—1,-1,-1), 
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the connected component of the unit element in CL, which is characterized by the 
conditions det A = 1 and Ago > 0 (see the notes), is also denoted as SO(3, 1). 
The groups P and G are, of course, different but the Euclidean group € is their 
common subgroup. 

The Poincaré group has many representations. Let us describe a class of them 
which is physically the most important. The easiest way to do this is to express 
the four-momentum components as multiplication operators. Given m > 0, we 
take Hn := {p € IR‘: po > 0, p? =p —p* =m’ }, i.e., the “mass shell” in the 
corresponding Minkowski space. It is homeomorphic to IR? by means of the map 
jm : p++ p; this makes it possible to equip Hm with a topology and to define on 
it the Borel measure wy, by 


dp 
on Ian TPE 


For any s = 0, 4,1, 3,... we denote Hms = L?(Hm,dwm;C**") ; then the 
following assertion is valid (see the notes). 


10.3.3 Proposition: For any m> 0 and s= 0, $s 1, 3, ..., there is just one irre- 
ducible (unitary, strongly continuous) representation of the (proper orthochronous) 


Poincaré group on Hy, which is of the form 


(U(A, a)h)(p) = e'(S(A)p)(A*p), (10.14) 
where S(-) is an irreducible representation on the Lorentz group on ©?+!. 


10.3.4 Remark: The explicit form of the representation S is known but we shall 
not need it; we refer to the literature quoted in the notes. In particular, for the 
subgroup of rotations in £ it reduces to the operators of spin rotations discussed in 
Example 10.2.4; this means again that a fermion wave function changes sign when 
rotated on the angle 27. 


Now we want to know how the wave functions of relativistic systems transform 
under the Poincaré group. Due to the postulate (Q5b), the state Hilbert space of 
such a system supports a representation of P. However, if we have in mind a par- 
ticular system, for instance, an elementary particle, our situation is substantionally 
different from the nonrelativistic case. There we have first chosen the state Hilbert 
spaces and operators representing observables for such simple systems, and then we 
looked for the ways in which they transform. Inspired by the foregoing considera- 


tions, we are not going to repeat the reasoning. Instead, we reverse the argument 
and postulate that 


(Q5c) the state Hilbert space H of a relativistic elementary particle of mass m 
and spin s supports a representation of the Poincaré group which is unitarily 
equivalent to (10.14). The operators on H representing the energy, momen- 
tum, and angular momentum coincide with the generators of the subgroups of 
time translations, space translations, and space rotations, respectively, in P. 
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Notes to Chapter 10 


Section 10.1 In fact, there are theories in which a map analogous to Too: may not 
exist; recall again general relativity, where there are space-time manifolds (Schwarzschild 
horizons), which prevent any possible communication between the observers who had the 
bad luck to exist on different sides of the barrier. In the existing quantum theory, however, 
this cannot happen. The argument showing that Too is represented by a unitary or 
antiunitary operator should be regarded as the analogous considerations of Secs.7.1 and 
9.1; our aim is to motivate a functioning scheme, which would relate the results of a pair 
of observers, and not to prove that it is the only possible. 

The assumption that the observers use the same operator to describe a particular 
observable resembles the Schrédinger picture of Section 9.2. An alternative is to keep 
the states fixed and to suppose that when the first observer describes an observable by 
an operator A, the other uses the operator T(g)~!AT(g) for the same observable. The 
conclusions of Section 9.2 extend to the present situation; the Heisenberg—type formulation 
is usually preferred in situations where a large number of observables is involved, as in 
quantum field theory. Needless to say, it is also possible to divide the transformation— 
induced change between the states and observables whenever it is suitable. 

Reduction of representations of symmetry groups is an efficient tool for investigation of 
quantum systems. Suppose that an observable A is preserved by a transformation group 
G. If the corresponding representation T(-) is reduced by a projection E, , Proposition 4 
imposes additional restrictions on the operator A. For simplicity, we denote Ep := I—E}, 
’ and furthermore, T;(g) := T(g)}Ran£;, Aj, := E;AE, ; then in view of the relation 
T(g)A Cc AT(g) not only T;(g) commutes with A;;, but also the “off-diagonal” parts 
have to satisfy the intertwining relations T;(g)Ajxz C AjxTx(g) for all g EG. 


Section 10.2 Basic facts about Lie groups and algebras are summarized in Appendix B.3; 
for further reading we recommend, e.g., [BaR], [FG], [Ham], [Kir], [Mac 2], [Nai 3] 
and [ Wig]. In these books the reader can also find more information about finite groups, 
which appear here in a few simple examples only, as well as about other applications of 
the theory of groups and their representations to description of symmetries of quantum 
as well as classical systems. 

More details about the HW-group and HW-algebra can be found, e.g., in [ Wol 1]. 
By Problem 4a, any (unitary, strongly continuous) representation of the Weyl relations 
generates a projective representation of the translation group To, with w(s,t;s’,t’) = 
4(t -s'—s-t’). Properties of this multiplier can be used to formulate an alternative proof 
of Theorem 8.2.4 — see [Si 1], Sec.7. Angular momentum, introduced in Example 3, is one 
of the fundamental observables. We shall return to it in Section 11.5; more information 
can be found, e.g., in [Ja], Sec.13-3; [Thi 4], Sec.3.2; or in special monographs such as 
[Edm], [BL]. 

The operator R is sometimes also called parity if the value of 7 is not important. The 
parity violation in weak interactions responsible for G—decays of atomic nuclei mentioned in 
Example 5 was predicted by T.D. Lee and C.N. Yang in 1956 and confirmed experimentally 
by C.S. Wu — see, e.g., [Schwe], Sec.10f; [FG], Chap.3. The essence of the effect is that 
the momentum component operators change sign in the mirrored system while the spin 
components do not; hence the assumption of space-reflection invariance would require the 
momentum distribution of the emitted electron to be symmetric with respect to the plane 
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perpendicular to the nucleus spin. 

The isospin group SU(2) extends to more general transformations between internal 
degrees of freedom of elementary particles. In the early sixties M. Gell-Mann and G. Zweig 
embedded it into the group SU(3), whose lowest-dimensional representations were con- 
nected with a triplet of quarks considered as building blocks of elementary particles; this 
idea led to the currently accepted theory of strong interactions called quantum chromody- 
namics. A decade later the discovery of J/Y and Y particles demonstrated the existence 
of two more quarks. In the meantime S. Weinberg and A. Salam constructed a unified the- 
ory of weak and electromagnetic interactions by combining the isospin SU(2) group with 
the Abelian group U(1) of gauge transformations (the latter should not be confused with 
the infinite-dimensional group of Remark 8d, which acts in the configuration space). This 
so-called standard model represents one of the most powerful applications of symmetry 
ideas in quantum physics; the reader can find more details, e.g., in [Hua 2], [CL]. How- 
ever, some ingredients of the standard model have still to be confirmed experimentally, in 
particular, the existence of the sixth quark. 

If the Hamiltonian of a fermion is time-reversal invariant, then by Problem 9c no 
eigenvalue of it can be simple. The same is true for any system containing an odd number 
of fermions; we speak about Kramers’ degeneracy. For most quantum—mechanical systems 
the symmetry with respect to the time reflection is checked easily — see, e.g., Problem 9d. 
On the other hand, a violation of T-invariance (the latter is equivalent to the so-called 
CP-invariance, i.e., the space reflection accompanied by changing signs of the particle 
charges — cf. [SW]) have been observed for weakly interacting elementary particles such 
as neutral kaons mentioned in Example 9.6.4. 

A well-known hidden symmetry exists in the hydrogen—atom Hamiltonian — see, e.g., 
[BaR], Sec.12.1. Hidden symmetries are usually manifested by the fact that eigenspaces of 
the Hamiltonian which, due to Proposition 10.1.4b, support a representation of the appar- 
ent symmetry group, have in fact a larger dimension than corresponds to an appropriate 
irreducible representation; we also speak about an accidental degeneracy. 


Section 10.3 A thorough discussion of properties of the Galilei group and its represen- 
tations can be found in the review {LeL 1]. The reader should pay attention to the fact 
that in general free-particle observables are not preserved by the operators representing 
elements of G in the sense of the definition given in Sec.10.1 (cf. Problem 14). Instead, 
they transform naturally, 2.e., in the same way as the corresponding classical observables. 
We say that the quantum mechanics of a free particle — and other theories with this 
property — are covariant with respect to G. However, with an abuse of terminology one 
sometimes speaks about a G-invariant theory having in mind its covariance with respect 
to the transformation group G. 

The fact that the mass represents a superselection rule in nonrelativistic quantum 
mechanics was first noticed by V. Bargmann — cf. [Ba 3], and also [LeL 1]; [ BaR J, 
Sec.13.4. We mentioned it in the notes to Section 7.4 as an example of a “continuous” 
superselection rule; however, one has to take into account that practically every quantum 
system consists of particles with a finite number of different masses, so the rule is in fact 
“discrete” after all. While in most quantum mechanical problems mass conservation is 
self-evident, sometimes this is not the case; for instance, in nonrelativistic decay models; 
then the Bargmann rule has to kept in mind — see, e.g., {DE 2], Part I. 


There are textbooks devoted solely to the special theory of relativity — see, e.g., 
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[ Vot ]; on the other hand, information about the Lorentz and Poincaré groups can be 
derived from some books on group theory such as [ BaR.], Chaps.17 and 21, and almost 
every book on quantum field theory — cf. the notes to Sec.7.1. 

As we have mentioned above, the “full” Lorentz group also contains the discrete 
transformations of space and time reflection. The connected component SO(3,1) of the 
unit element which we consider here consists of A € £ with detA = 1 and Ago > 0 
(compare to Example B.3.2b). It is called the proper orthochronous Lorentz group and is 
denoted as CL. ; the same conventions apply to the Poincaré group. However, if we are not 
interested in the reflections, we usually drop the adjectives, subscripts, and superscripts. 

Some Lie algebras are related by a limiting procedure called contraction. In this way, 
in particular, the Poincaré group passes to Galilei when the velocity of light (which, for 
simplicity, we have put equal to one here) approaches infinity — see [BaR ], Sec.1.8. 

An explicit expression of the representation S(-) appearing in (10.14) can be found 
in many places — see, e.g., | SW], Secs.1-3; [ BLT ], Sec.2.3. To construct these and 
other representations of P, one usually replaces SO(3,1) by a simply connected group 
in analogy with Example 10.2.4; this time it is the group SL(2,C). It appears, however, 
that the sought representations of P are determined uniquely by a representation of the 
subgroup SU(2) Cc SL(2, @) ; the remaining operators are obtained by means of the group 
composition law. This is the so-called induced-representation method which is described 
and used for classification of irreducible representations for a wide family of groups, e.g., 
in | BaR], Chap.17. The classification for the Poincaré group appeared for the first time 
in | Wig 1]. Let us also remark that irreducible representations of the full Poincaré group 
including reflections are classified by one more discrete index (parity). 

Representations of P associated with relativistic systems other than stable elementary 
particles are no longer irreducible. For systems consisting of a finite number of elementary 
particles such representations are easily constructed by tensor product — cf. Section 11.5 
below — and in a similar way one can proceed for more complicated systems in second— 
quantization formalism. On the other hand, representations associated with unstable 
particles are typically direct integrals of the representations (10.14) — see [BN 1], [Ex 3] 
or [Ex], Sec.3.5, and also Problem 16. 


Problems 


1. Justify the relation (10.1). 


2. Using (10.2), we can write the relation (10.4) alternatively in the form E,4(A) = 
w(g-4,g)~!T(g-!)Ea(A)T(g) . Is it possible in general to dispose of the multiplier? 


3. Let A bea linear operator, in general unbounded. An antilinear bounded operator 
T is said to commute with A if TAC AT. Prove: 
(a) if A is self-adjoint, an assertion analogous to Theorem 5.3.1b is valid, 


(b) the operator T commutes with A iff Te'4t = e4'7 for all t € IR, and 
this is in turn equivalent to the relation T R4(z) = Ra(Z)T for all z & p(A), 

(c) if, in addition, T is antiunitary, then A,T commute iff A = T-'AT = 
RAR 


376 CHAPTER 10. SYMMETRIES OF QUANTUM SYSTEMS 


4. Prove: (a) The operators (8.9) or (8.10) form a projective representation of the 
translation group Ton. 


(b) The Heisenberg—Wey] group of Example 10.2.2 satisfies the group axioms with 
e:=9(0,0,0) and 9(s,t,u)~! := g(—s, -t,—u). 


(c) The HW-group can be represented by the matrices 
1 8+it 8+it 


G(s,t,u) = | —stit 14+2iu— 4(s?+#?)  2iu— $(s?+ t?) 
s—it —2iu+ 3(s?4+t?) 1-2iut 3(s?+1?) 


The matrices Q;, Px, C defined by Q; := —i(0G/08;)g=e , etc., form a repre- 
sentation of the HW-algebra. 
(d) Why does this fact not violate Theorem 8.2.4 ? 


Hint: (a) Use Problem 8.15. 


5. Prove: (a) The relation xz’ = Rx defines a bijective correspondence between rotations 
of the Euclidean space JR° and elements of the group SO(3), i.e, 3 x 3 
orthogonal matrices R, R'R = I, with det R = 1. Extend this result to 
rotations of JR”. 


(b) Find the subgroups of SO(3) corresponding to rotations around the axes. 
Let R;(@) be the rotation around the j-th axis on the angle @ ; compute 
R;(6)Rx(@’) for 7 #k. 

(c) Let R(a, 8,7) := R3(a)R2(G)R3(y). Check that any rotation is of this form 
for some a, 3,7. 


6. Prove: (a) Let U € SU(2) and € € R?° ; then there is a matrix R € SO(3) 
such that U€-0U-! = D5k RikEjok, Where o, are the Pauli matrices and 


€-0 = Veber. 


(b) The map SU(2) — SO(3) defined in this way is a homomorphism which is 
not injective; on the other hand, {U,-U} + R is a bijection. 


7. Using the notation of Example 10.2.4b, prove 


(a) the orbital-momentum components equal L; = 2, €jnQ,P;, where Q,, P, 
are the operators of Example 7.5.4b, 


(b) the total angular-momentum operators J; := L;,+8; generate a unitary 
representation of the rotation group on L?(JR3, ©28+!) by 


T (ax, 6,7) := befits 
and T(g) = To(g) ®Ts(g) holds for any g = (a,{,7). 
8. Let P:=7R be the parity operator on L?(JR°). Prove that 


(a) P is Hermitean; write the corresponding spectral decomposition, 
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(b) P is reduced by the eigenspaces of the operator ‘Spe L; ; find its parts in 
these subspaces, 


(c) extend the conclusions to the parity operator on L?(IR?) @ ©?¢+!, 
Hint: (b) Use Proposition 11.5.2. 
9. Consider the time-reversal operator T of Example 10.2.7. 


(a) Prove the relations (10.9). 
(b) If the spin component operators (7.11) satisfy the conditions TS$;T = S,, 
j =1,2,3, with T:=— K @V, then Vinm = dm—m! elo-™m) for some a € R. 


(c) J? = (—1)*%I where s is the spin of the particle. It follows that if s is 
half-integer and the Hamiltonian is time-reversal invariant, any eigenvalue of 
it has at least multiplicity two. 


(d) If the operator (9.2) with a real-valued potential V is e.s.a., then the corre- 
sponding Hamiltonian is time-reversal invariant. 


Hint: (b) The assumption implies VS;V—! = —S;, where 9; is the complex con- 
jugated matrix of S;. (c) Apply T? to an eigenvector. 


10. Let H be the Hamiltonian of the three-dimensional harmonic oscillator on L?(JR°) , 
i.e., H:= P?+Q? with P? := 3_, P? and Q? := 3_., Q?. Define the operators 
Tj = fax + 453~, where aj := 2~1/?(Q;+4P;), and prove 


he — Dies Tj; and [H,Tjx)y =0 for any  € SUR) and j,k =1,2,3, 
(b) the operators T;, form a representation of the Lie algebra u(3), i.e., they 
satisfy on S(JR°) the relations 
[Tij, Tre] aa jk Lia — ST jx i,j,k,l=1,2,3, 


and T;, :=T;, — 46j,H form a representation of su(3) , 


(c) the Lie algebra of the rotation group is the subalgebra in su(3) whose repre- 
sentation is generated by Li +iL2 := Ti2, its adjoint, and L3 := Ty, — Tog. 


11. Suppose that the functions V, A, satisfy the assumptions of Theorem 14.1.8 and 
A: IR? | RR is a function whose first and second partial derivatives are continuous 
and bounded. Let H(A) := 54 (P—- eA(Q))*?+ V(Q), and in a similar way H(A’) 
corresponds to the gauge-transformed vector potential, Aj, := A, —0,A. Prove 
that H(A’) =U,HU;x?, where Uj := e@4(9), 


12. Let Ga:={xrre’x : s€ R} be the dilation group of the space JR”. 


(a) Check that the operators Ug(s) : (Ua(s)~)(x) = e”8/2y)(e8”) form a contin- 
uous unitary representation of Gg on L?(UR"), and find the corresponding 
representation of the Lie algebra of Gq. 


(b) How do the fundamental observables transform with respect to Gq ? 


378 


13. 
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16. 


CHAPTER 10. SYMMETRIES OF QUANTUM SYSTEMS 


(c) Show that in the case n = 3 the representation is reduced by the eigenspaces 
of the operator L?, and find the parts of Ug(s) in these subspaces. Extend 
these conclusions to a general n > 2. 


Hint: (a) Cf. Example 5.9.5b. (c) Use Remark 11.5.4. 
With the notation of Theorem 10.3.1, prove 


(a) (ciety et) (x) = eimu-(v(b+0')+22—a)/2y),( + vt/) holds for 
all géG, be R, and » € L7(R°, C**+), 

(b) use this result to check that the relation U(g’)U(g) = w(g’,g)U(g’g) is valid 
for any g,g' €G, where w(g’,g) := exp (§m(v' -Rla—a’'-R'v— R'v- v'b)) , 


(c) check that the condition (10.3) is valid for the multiplier w. 
Hint: (a) Use (7.4) and Example 5.5.1b. 


Find the transformation properties of the free-particle Hamiltonian H := xP? un- 
der Galilean boost, i.e., the operators H(v) := U,HU;! where U, :=U(0.0,v,/). 
How do the component operators of position, momentum, angular momentum and 
spin transform under G ? 


Consider the Galilei-group transformations gg := (0,a,0,J) and gy :=(0,0,v,J). 
Check that gi go 'guga = e and U(g,')U(gz')U(g.)U (gay = e'™**y for any 
a,v € IR? and y € L?(R?, C+). 


In the setting of Section 9.6, let U(-) be a representation of the Poincaré group 
on H and V : V(A,a) = U(A,a)}H, its restriction to the state space of the 
unstable system (by Proposition 9.6.1, V is not a representation of P ). Show that 
if the unstable system is preserved by Euclidean transformations (in particular, 
the projection E, commutes with the translation operators), then the operators 
V(A, 0) associated with Lorentz boosts cannot be unitary. 

Hint: Use the identity U(I, Aa)U(A~',0)U(I, —a)U(A, 0) = U(I, Aa — A~1a), and 
choose A := A(G 3) as a boost with a velocity 6 and a: = (0,a@) where @ is parallel 
to 6 


Chapter 11 


Composite systems 


Most information about microscopic physical systems comes from investigation of 
their mutual interactions and the ways they manifest themselves in bound states, 
scattering processes, etc. A treatment of such situations requires knowledge of how 
a quantum system composed of a certain number of subsystems can be described. 
This is the topic of the present chapter. 


11.1 States and observables 


Suppose that a quantum system S with the state Hilbert space 1 consists of a finite 
number of distinguishable entities 5),...,5,, which we call subsystems. Examples 
are plentiful; recall the hydrogen atom, consisting of a proton and an electron. We 
could attempt to express in a more formal way what a subsystem should be, but we 
refrain from doing so limiting ourselves to the assumption that each S; is by itself 
a quantum system endowed with its own state Hilbert space ;. The system S 
will then be said to be composed of Sj,...,5n. 

Though a practical reduction of a system to its subsystems cannot continue ad 
infinitum, the requirement of finiteness is certainly restrictive, because it is often 
useful in the theory to consider an infinite number of components. This would lead, 
however, to additional mathematical problems, so for the present moment we accept 
this limitation, and postpone the discussion of the infinite case to the next chapter. 
Moreover, we shall speak mostly about the simplest case of a system consisting of a 
pair of subsystems, because the extension to any finite number is straightforward. 

We have already several times encountered the system of n spinless particles 
whose state Hilbert space is H = L?(JR*”) ; each particle then represents a subsys- 
tem with the state space H; = L?(UR*). Due to Example 2.4.5, H can be identified 
in this case with the tensor product of one-particle state spaces; motivated by this 
fact we adopt the following general assumption: 


(Q6a) the state Hilbert space of the composite system is the tensor product of the 
subsystem state spaces, H = Hi ®-:-@ Hn. 


379 


380 CHAPTER 11. COMPOSITE SYSTEMS 


It is necessary to make a reservation from the beginning: in the present form the 
postulate is suitable only if S consists of mutually different subsystems. A modifi- 
cation for the situation when some of them are identical will be given in Section 11.4 
below. 


11.1.1 Remarks: (a) The state Hilbert space can often be written in a tensor 
product form even if no real physical systems can be associated with the com- 
ponent spaces. The results we shall derive therefore also apply to systems 
composed of such “fictitious” subsystems; recall, for example, the spinless 
particle consisting of three “one-dimensional” particles or the electron com- 
posed of the “configuration-space” and “spin” electron from Examples 7.2.3 
and 7.5.4b. These decompositions are related to the concept of the number of 
degrees of freedom; by this the number of the simplest fictitious systems into 
which a given system can be split in usually understood. 


(b) We mentioned in Section 7.1 that the state Hilbert spaces of quantum mechani- 
cal systems are usually supposed to be separable. Now we can formulate an 
argument in support of this hypothesis. A typical quantum mechanical system 
consists of a finite number of particles, and each of them can have some internal 
degrees of freedom such as spin, isospin, etc., whose number is also finite. The 
total state space is therefore the tensor product of a finite number of Hilbert 
spaces. Among these, those corresponding to the configuration-space degrees 
of freedom are isomorphic to L?(R) by the Stone-von Neumann theorem, and 
are thus separable. On the other hand, spaces associated with known internal 
degrees of freedom are even finite-dimensional, t.e., separable again; in view 
of Proposition 2.4.4b the total state space is then separable. 


Let us now look at the relations between the observables of a composite system 
S and its subsystems S;,S2. We shall use the notation introduced in Section 7.4: 
the symbols © and ©; denote the sets of observables of the respective systems, 
and in a similar way we shall denote the sets of bounded observables and algebras 
of observables. Consider first the observables which are related to one system only, 
say, to S,. With any such observable A we associate a self-adjoint operator A; on 
H, , and at the same time, an operator A on H if we regard it as an observable of 
the whole system S. The values we obtained as results of the measurement are not, 
at least in principle, influenced by the presence of the other system. It is natural 
therefore to assume that A = A, in which case o(A) = 0(A;) by Problem 5.38b. 
Let us stress that, in contrast, the probability of finding the value of A is influenced 
because it depends on the state of the composite system; we shall discuss this in 
more detail in the next section. 


11.1.2 Example: Consider the system of two (different) spinless particles with the 
state space L?(IR®) = L?(IR%) @ L?(IR®) and denote by Q, P” the operators 
of Cartesian components of the position and momentum of the j-th particle, re- 
spectively. Following the above assumption, we have to define Q; := () ®I for 
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o="1) 2) seed: QO; =o 12Q?, for | = 4,5,6, and the operators P,; in an analogous 
way. It is clear from Example 7.2.3 that Q; = Q@I@-:-@1, etc.; this means 
that the choice of the position and momentum operators for a system of particles is 
consistent. 


Observables of the mentioned type do not, of course, exhaust the family O 
which also contains A,+ A, for A; € O;, other operator polynomials of these 
operators, etc. They nevertheless play an important role. 


11.1.3 Proposition: (a) The algebras of observables of the composite system and 
its subsystems are related by AD A, @ Ap. 


(b) Suppose that the state spaces H; are separable. If S; C O,; are complete sets 
of compatible observables for the subsystems S;, then S := { A; : A; € S;} 
is a CSCO for the system S. 


Proof: Since the algebras of observables in question are generated by Q,;UQs. C O,, 
assertion (a) follows from Theorem 6.6.5; in a similar way, part (b) is implied by 
Corollary 6.6.6. 


Assertion (b) illustrates, in particular, how the state Hilbert space is associated 
_ with a given quantum system. If we have a set of compatible observables A;,...,An 
and we believe that it is complete, we choose a space H and self-adjoint operators 
A,...,An with the needed spectra, algebraic relations, etc., so that they form a 
CSCO on #; naturally we do not start from zero because the founding fathers 
of quantum mechanics have spared us most of this effort. It may happen that 
in the course of time a new observable B is found, which is not a function of 
Aj,.--,An but is still compatible with them. We are then forced to find self- 
ana operators A,,...,A,,B which form a CSCO on a Hilbert space H ; at 
the same time we have a demand that the properties of A,...,A, are preserved, 
in particular, 0(A;) = o(A;) for j ="lynwyn. This problem is usually solved 
by setting H := H @®H, for some Hilbert space H, and A; := Aj;@I,. The 
observable B is then represented by B := 1@B, where the opts B oneH, 
is chosen in such a way that it has a simple spectrum; _then Theorem 5.8.6 in 
combination with Proposition 3b ensures that Aj,...,An,B isa CSCO. A classical 
example is the addition of the (third component of) spin to the particle coordinate or 
momenta proposed by H. Uhlenbeck and S. Goudsmith to explain the Zeeman-effect 
controversy about the number of atomic levels in a magnetic field. 

We want to know, of course, under which circumstances the inclusion of Propo- 
sition 3a turns into identity. In some cases, the following simple coherence result 
provides the answer. 


11.1.4 Proposition: Suppose that S; C O;, 7 = 1,2, are irreducible sets; then 
the set S := {A,: A; € S;} is irreducible on H; @ He and the system S is 
coherent. 
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Proof: The first assertion follows from Theorem 6.7.3; the set O is then also irre- 
ducible, so coherence is implied by Theorem 7.4.4b. 


11.1.5 Examples: (a) Since a spinless particle is a coherent system, by Proposi- 
tion 4 the same is true for any system of (mutually different) spinless particles 
— cf. Example 7.4.5b. The result remains to be valid even if some of the 
particles are identical, as we shall see in Section 11.4. 


(b) By Problem 7.15, the spin component operators S;, j7 = 1,2,3, of Exam- 
ple 7.5.4b form an irreducible set on €?+!; hence {Q;, Py: § = 1,2,8} 
is irreducible in the state space L?(JR*) @ €?*+', so a particle with a nonzero 
spin is coherent as well. Combining this with the previous example, we find 
that a system of particles with arbitrary spins is coherent. 


In all these examples we have A; = B(H;), and therefore A, @ Az = B(H; @ He). 
This ceases to be true if we add, for example, the isospin; if both subsystems 5S), S» 
are non-coherent the identity in Proposition 3a may not hold. 


11.1.6 Example: Consider the system consisting of a nucleon and a pion; for 
simplicity we neglect the nonisospin degrees of freedom (Problem 1). The nucleon 
state space is then C? and any observable is of the form Ay = ar ye where 
EW) a= | — EW) := $(03+ 1) ; similarly the pion space is €* and its observables 
ate Apa= Dp ee” , where E are again the projections to the eigenspaces 
of the charge operator Q = S}__, kB”) . The state space of the composite system 
is therefore #1 ~ C® and it is easy to check that 


1 1 
An ® Ar = {32 E Aubin dee) 


j=0 k=-1 


where«E;, := ES @ EB”) . However, the only superselection rule for the composite 

system is given by the total charge operator Qnx ‘= Q,+Q_. Its spectral de- 

composition is Qnvx = 23__,qEq, where Eq := Dj4k=q Ejx ; this means that its 

eigenspaces H, := E,H are two-dimensional for g = 0,1 and one-dimensional for 
= —1,2, and 


2 
Anz © @ B(H,). 
q=-1 

If the two algebras were identical, any observable of the nucleon—pion system would 
be reduced by all the projections Ej, , and this would in turn mean that any state 
of the system would be stationary. This contradicts the experimentally established 
existence of the transitions pr° + n+ and pr_ 4 nr; hence Ay, must contain 
observables which do not belong to A, @ A,. 

This conclusion clearly requires that both the involved systems are noncoherent. 
If we replace, for instance, the pion by an 7 meson (regarded for this purpose as a 
stable spinless particle), which is a charge singlet, we get Ann = An ® Ay. 
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Thus in general a more detailed analysis of the algebras A; is needed to decide 
whether they span the algebra of observables of the composite system. When dealing 
with quantum mechanical systems, we usually avoid this problem by assuming that 


® 
A = Y0°B(Ha), ree 
aed 
where the the coherent subspaces H, now correspond to the superselection rules of 
the composite system. 


11.2 Reduced states 


Postulate (Q6a) in combination with possible superselection rules tells us what the 
families of states of the composite system and its subsystems look like. Now we 
want to consider a more complicated question, namely in which states W;, W2 are 
the subsystems if the composite system is in a state W, and on the other hand, 
what can be concluded about the state of S from the knowledge of the states of S, 
and S». For simplicity, we shall assume throughout this section that we are dealing 
with the quantum mechanical case, i.e., that the algebra of observables of each of 
the subsystems is of the form (11.1). 

Suppose therefore that system S is in a state W, and let us denote the sub- 
system states as W;, 7 = 1,2. It is natural to require (A;)w = (A;)w, for any 
observable related to one of the subsystems only, where we again use the nota- 
tion A; := A; @ Iz, etc. It follows then from Theorem 7.3.3b that Tr(A;W) = 
Tr (A;W;), 7 = 1,2, must hold for all A; € A; (in fact, we should index the traces 
to indicate the space to which they refer, but we refrain from doing so as long as 
there is no danger of misunderstanding). This requirement is sufficient to determine 
uniquely the states of the subsystems. 


11.2.1 Proposition: Let W be the statistical operator describing a state of S; 
then there is a unique pair of statistical operators W,;(W) corresponding to realiz- 
able states and such that 


holds for all A; € A;. 

Proof: Consider, for example, the subsystem S, and define the functional fw : 
fw(Ai) = ee ; it is easy to see that fw is linear, positive, and satisfies 
fw(h) = 1, ie., it is a state on A;. We shall check that fw is normal using the 
eee —~ mentioned in Section 6.4. Tae a sequence of positive operators 
Ne 2 © Ajusuch that Ap = w-limy.00 mie A” exists. The operator A, is then 
again positive and equals w-limnpoo Dra1A (*) (Problem 3). Due to Theorem 6.4.1, 
the operator W determines a normal state on B(H,) D Ai, so 


fw(A:) = Tr(A,W) = yT(arw) a Av ( Al”), 
—iil k= 
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the state fw is therefore normal, and since the algebra .A; is of the form (11.1), 
it follows from Theorem 6.4.7 that there is just one statistical operator W,(W) , 
which is reduced by all H, and satisfies the condition fw(A:) = Tr(Ai\W,(W)) 
forall A,E A;. Ff 


The states W;(W) are called reduced (sometimes also component) states of 
the given W. We shall see in a while that we can not only prove their existence, 
but also find an explicit expression for them. First, however, let us ask the opposite 
question, namely to what extent is the state of S determined by the states of its 
subsystems. Given statistical operators W; on H; we easily find W such that the 
identity (11.2) is valid; in view of Problem 4 it is sufficient to choose W := W, @W4. 
However, the maps W,(-) are in general not injective. 


11.2.2 Example: Consider a pair of spin 5 particles, say, two electrons or a proton 
and an electron. For simplicity we again neglect the nonspin degrees of freedom and 
put H,; := (? ; the eigenvectors of the third component of spin of the j-th particle 
will be denoted as o? . A standard orthonormal basis in H = H, ® He consists of 
the vectors 


Pio (= se? @d? +9 ent”), dia = 2 @ 
i 
to = (9 242 — 6 @ dt), 


which are eigenvectors of the total spin of the pair and its third component; the first 
three of them span the triplet subspace H\) := EH, the remaining one spans the 
singlet subspace H\ := EyH (see Problem 16). The density matrices W; := 2; 
describe the unpolarized states of the two particles (cf. Example 7.3.2). We have 
Tr(A;W;) = $Tr A; for any A; € B(€?) ; on the other hand a straightforward 
computation using the above basis shows that Tr (A;E,) = 3 Tr (A;Eo) = 3 Tr A;. 
In this case therefore a statistical operator W = W(wo, w:) := woo + w; EF, , which 
satisfies (11.2), corresponds to any non-negative wo, w; such that 3w;+wo = 1, in 


particular, W (1,1) = W@W. 


The reason for this nonuniqueness is clear: the relation (11.2) defines the state of 
the composite system on a subset A, UA, C A which is too small to determine the 
algebra A completely. In some cases, however, the uniqueness can be guaranteed. 


11.2.3 Theorem: (a) If the subsystem 5S, is coherent and the state W2 is pure, 
then the relation (11.2) is statisfied by W = W, @ W, only. 


(b) If the states of both subsystems are pure, W; = Ey, , then only the pure state 
W = Ey, where wp := % @ ye, satisfies (11.2). 


Proof: We choose orthonormal bases {¢;}, {xx} in the spaces H; consisting of 
eigenvectors of the operators W; and W2, respectively; the corresponding sequences 
of one-dimensional projections are denoted by {FE} and {F}. If W2 is pure, 
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we may assume without loss we age: that W, = F . Substituting A, := FP“ 
in (11.2), we get de: = Tr (EW) = 0; Wie where Win,et = ($:@Xn, W (be ®@x1)) 
are the matrix elements of the cet, W . Its positivity implies Wiz. = 0 unless 
k =1. Furthermore, by choosing A, := E , the last result implies 


Wir = > Wie = Te (EOW,) = wy, 
k 


where wi? is the corresponding eigenvalue of the operator W,. Let {tm} be an 
orthonormal basis in composed of eigenvectors of the operator W, Wum = 
Wmlm. We denote af”) = = (Wm, oi ® Xe) ; using the fact that W is Houeded we 
easily find W(¢¢ @ x1) = Xn Oy Wnt , and therefore Win,kt = En ea” Wyn - 

Recall that Wixix = 0 for k # 1 ; the eigenvalues w,, are non-negative, so if 


Wm > 0 and n#1 we get af” = 0, i.€., 
Wan=O0, n#1 or Il. 


If W, is pure we may at the same time choose the basis {¢;} in such a way that 
W, = E“. Repeating the argument with the roles of Wi, W2 interchanged we 
get Wings = O unless i = k = 1. Together we have Wins = 6:16n16n16n = 
(W, @ Wo)inet for all i,n,k,l, so the operators coincide, W = W, @W.. The 
states W; are pure by assumption; hence W? = W? @ W? = W, @ W2 = W and 
W is ne pure. This proves assertion (b). 

Up to now we have not employed the coherence assumption. If S; is coherent, 
then Tr(B,W,) = Tr(B,W) holds for any B, € 7; ; using the relation Wij = 
wo) proven above, we get 

S(Bi) eWay = 0. 

i¢k 
We have Six |Weiiil? = Diner |Weal? = ||WIIZ < oo, which means that the 
matrix {Whi : i,k = 1,...,dimH,} represents some operator C, € B(). 
Substituting B, := C, in the above relation, we obtain Wii, = 0 for i Ak. In 
combination with ne two displayed relations from the first part of the proof, this 
implies Wijn = w; ” 6 ieb5161 = (W, © W2)in for all values of the indices, 1.e., 


W=W,8W.. & 


Let now ask now what the reduced states look like. Without loss of generality 
we may assume that the composite system is in a pure state W = Ey, (Problem 5b). 
We choose orthonormal bases € = {¢;} and F = {xx}, arbitrary for the moment, 
in the spaces H,, H2, respectively, and express the vector ~ representing the state 
W in the tensor—-product basis, = Yii.4 Qin0i ® Xx. By assumption, 7 is a unit 
vector, so }ix{QiK%|? = 1. The left side of (11.2) can be expressed formally as 
Wine Guna(Gi, Ai’e)1. We shall check that the series is absolutely convergent, so 
we can rearrange it obtaining 


mm A,W) = ddl Aidx)i, 
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where bg; := 5; Gijxj. The Hélder inequality together with the normalization 
condition implies 5,5 |bi%|? < 1, and if A; is a Hilbert-Schmidt operator then 
another application of the Hélder inequality yields absolute convergence. The matrix 
(b;;) represents in the basis € an operator of J2(H;) which we denote as W, It 
is easy to check that W“) > 0, and moreover, TrW") = D, bi = 1 so W™ is a 
statistical operator. Finally, 


Tr(AiW™) = So bei(Gi, Arde): = Tr (Ai1Wi(W)) 
ik 


for all A; € A, M.Jo(H,). In the same way we can construct the statistical operator 
W) € B(He2) , which is represented in the basis F by the matrix with the elements 
Ci := D0; Gij0y and fulfils the condition Tr(A,W®@) = Tr(A2W2(W)) for any 
observable Az € A2M Je(H2). 

If the two subsystems are coherent, we can set A; := W)— W,(W) in these 
conditions; it follows that W;(W) = W”. This conclusion may not be valid if 
at least one subsystem is incoherent, since it is then not ensured that W € A;. 
However, we can use the projections E) referring to the coherent subspaces in H; 
to construct the statistical operators 


W; := >> EYW® EO (strong convergence) (11.3) 
ael 


such that Tr(A;W;) = Tr(A;W)) holds for all A; € A; (Problem 6). Now the 
operators W;— W;(W) already belong to A; Je(H;), so the same argument as 
above implies W;(W) =W”), 

The relation (11.3) together with the expression of the operators W’) by means 
of the Fourier coefficients of the vector ~ , 


Wg, 


nj nq 
> binde, bie = D> Grane , 
k=1 t=] 
(11.4) 


n2 m1 
Wy, = > exes Cx = > axon , 
k= =1 


where n; := dimH;, fully answers the question of what the reduced states corre- 
sponding to a given pure state ~ look like. In the particular case when the sub- 
systems are coherent, the reduced states are given directly by the relations (11.4), 
which are called reduction formulas. 

Next we want to show that if the orthonormal bases are suitably chosen the 
relations (11.4) can be cast into a certain standard form. Suppose therefore that 
& = €, where € := {;} consists of the eigenvectors of W'!) ; the corresponding 
eigenvalues will be denoted as wi)? , The first formula of (11.4) then yields 6, = 
ws? dix independently of the choice of the basis F . If w\) > 0, then at least one of 
the coefficients aj, k = 1,...,n2, is nonzero, so we can construct the unit vector 
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= (w!”)-!/2 5, aux: , which satisfies the identity 


(xe WX i)o = (wh?)-/? S owe = (wh?)-¥? 5 at D> GintOnk - 
t 1 n 


Using the Holder inequality together with the relations (11.4) and wi) =1,we 
easily check that the series on the right side is absolutely convergent, Son) |@inQinQik| < 


(w” cri) 2 Interchanging the order of summation, we get 
(xe, WX)o = (wh)? Scud: = one (wh)¥?, 
1 


and therefore W 7; = w{¥;. This means that any nonzero eigenvalue of W is 


at the same time an eigenvalue of the operator W) . Repeating the argument with 
the roles of the operators interchanged, we find o,(W) \ {0} = o,(W)) \ {0} ; 
these common — will be denoted as w;. The sequence {Xi}7.,, where 
n := dim RanW = dim Ran W) , can be completed to an orthonormal basis F 
in Ha. Then € x F is a basis in WH and the Fourier coefficients of the vector w 
with respect to it are 


(pe Xi, W = Vayu ue Lo Atma (Ge @ Xs $i X5) wey pa 2, Amt = Vwi Or - 


i,j 


Hence we have 


y= Smee 


i=l 


(11.5) 
W) = yb, W = Sur, 
i—1 


ea) 


where B®, Fe are the one-dimensional projections corresponding to the vectors 
of the bases € and F, respectively. Let us summarize the results. 


11.2.4 Theorem: If the composite system is in a pure state W = E,, then the 
reduced states W;(W) are given by the relations (11.3) and (11.4). In particular, 
if the subsystems are coherent, then there are orthonormal bases € and ¥ which 
enable us to express the reduced states in the normal form (11.5). 


11.2.5 Exppple: ,Conaides again the two-particle system of _ temple 2. The states 
Ey 41 := EW) @ EY reduce by Problem 5a to W;(E1,41) = EY). On the other hand, 
the projection Eo to the singlet subspace represents an sole of a pure state, 
which cannot be expressed in the form ae ® ae The Fourier coefficients of the 
vector ¢oo with respect to the basis { gf? @ g 4 ,j = +} are easily found to be 
O44 =a. =0 and a4_ = —a_4 = 27”, so the reduction formulas (11.4) give 


i : 
W3(Eo) = 555; J =wg2.. 
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Putting further ds := o¢ ) and Xs = +¢2 , we are able to write ¢oo in the normal 
form (11.5), 


goo = Fz br Oi + 4-08). 


The pure state described by the vector ¢1o leads again to the reduced states 
W;(Ew) = 41;. The same is true for the mixed triplet state 3 ,, where Prob- 
lem 5b gives W,(4 Ei) = 31; + LED + BE) = 3 1;, and also for any mixture of 
the states 4 £, and Fp. 


11.3. Time evolution 


Let H and H; be the Hamiltonians of the system S and its subsystems, respec- 
tively; in a similar way we denote the corresponding propagators. The operator H 
is often of the form 


where Hinz is the interaction Hamiltonian of the subsystems. The splitting of the 
energy observable into this part and the free part, which is the sum of the subsystem 
Hamiltonians, usually has a direct physical interpretation; it makes sense, of course, 
if the operator H, + Hy + Hint is e.s.a. When the interaction Hamiltonian is 
Hermitean, this property follows easily from Theorem 5.7.2, while in the general 
case it is sometimes highly nontrivial to check it. 

The subsystems 5), Sp are said to be noninteracting if Hing = 0. Since the 
generator is associated uniquely with a continuous unitary group, Proposition 5.9.6 
yields the following simple criterion. 


11.3.1 Proposition: The subsystems 5S, Sz are noninteracting iff their propaga- 
tors are related to the propagator of the composite system by U(t) = U,(t) @ U2(t) 
for all te IR. 


11.3.2 Remark: We can speak about noninteracting subsystems even if the sub- 
systems S; are nonconservative, so the corresponding H, are replaced by H;(t) in 
(11.6), and Hinz = 0. The tensor-product decomposition can in this case be proven 
again, provided we are able to check the existence of the corresponding unitary 
propagators — see, e.g., Problem 8. 


If the subsystems interact, the relation between their time evolution and that of 
ti.2 composite system is more complicated. We know from the preceding section that 
for any state W; ofa com pails quantum mechanical system there is a unique pair 
of the reduced states W,?) := W,(W,) of the subsystems. Their time dependence 
can exhibit rather pathological properties: 


(i) the quantity Tr (we ))2) may not be preserved; in particular, a pure state may 
evolve into a mixed one and vice versa, 
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(ii) the operators U;(t,s) connecting the states of the subsystems at different 
times, W = U;(t, s)W\D)U;(t, s)~!, may not exist, and if they do, they may 
not have the properties required for a propagator. 


11.3.3 Example: Consider again the system of two spin 3 particles of Examples 
11.2.2 and 11.2.5 and set 


where o; are the Pauli matrices and 4 € R. If other than spin degrees of freedom 
are neglected, this operator represents a typical nonrelativistic interaction of the 
particles. We shall assume that the state is described at the initial time instant 
t=0 by oy? ® go) . The eigenvalues of the operator 0 @o are 1 and —3; using 
Example 5.2.3, we find 


U(t) = ePHe@e)2 — | co5(At) — SI +080) sin(A2) | 
(cf. Problem 9), so 
w(t) = (UA) (@P @4®)) = [@P @ 4) cos(At) — i(6© @ oP?) sin(At)] e™*/?. 
The reduced states W;(t) := W;(Eyety) are then 
W(t) = E,cos*(At) + E_sin?(At), We(t) = E,sin?(At) + E_ cos?(At), 


where Ex = 4(I+03). Since Tr (W;(t)?) = $(3+cos(4At)) we see that the reduced 
states are mixed unless t = a for an integer k ; the operators U;(t,s) exist only 
if the involved time instants satisfy the condition t+s = = : 


From the practical point of view, however, these difficulties are not very impor- 
tant because in most cases we are not interested in the time evolution of reduced 
states (see the notes). On the other hand, they have an implication for the time— 
evolution postulates of the previous chapter. We have tacitly assumed that the sys- 
tem is either isolated or it interacts with the environment, which is not influenced 
by its presence. The example shows that the assumption is substantial; without it 
the mere existence of the evolution operator is not guaranteed. 


11.4 Identical particles 


We have stressed in the introduction to this chapter that the postulate (Q6a) applies 
only to cases when the considered systems are mutually different; now we are going 
to discuss the situation when the composite system contains two or more identical 
subsystems. We first have to say a few words about the meaning of identity of 
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microscopic systems. For simplicity we shall speak about elementary particles; how- 
ever, the conclusions also extend to systems composed of nuclei, atoms and other 
“nonelementary” constituents. 

Properties of a particle can be divided into two groups. Some of them, for in- 
stance, its spatial localization or a spin projection, determine its state. The other 
group includes internal characteristics like the mass, spin, electric charge, etc., which 
are used to classify the particles. The number of different elementary--particle sorts 
is known to be finite, and even not too large; just six of them (together with their 
antiparticles) are stable, among them only the electron and the proton (whose sta- 
bility has also been challenged) can be claimed with certainty to have a nonzero 
rest mass. If we add the particles with lifetimes of 2 107'®s the number will grow 
to about thirty varieties. The principal assumption of quantum theory is that two 
particles of the same sort cannot be distinguished. 

In classical mechanics identical objects, e.g. point particles of the same mass, 
can be distinguished by means of their states: we can ascribe a trajectory to each 
of them, so it is sufficient to tag them somehow at the initial instant. In quantum 
theory, however, this idea does not apply because localization is not preserved in 
general (cf. Problem 9.8); once the wave functions of two particles overlap there 
is no way to determine which one of them has been registered in the intersection 
of their supports. Hence we have to adopt the mentioned indistinguishability 
principle of identical particles with all its consequences; we have to exclude any 
observable which would make it possible to tell one particle from the other. 

Consider first a pair of particles each of which has the state space H. We are 
going to show that it is sufficient now to consider a certain subspace in H‘ := 
H®H. We set 


Up (sow 0 4) = Yo be O Ue (11.7) 
k=1 k=l 


for any dx, Ye € H and a positive integer n. This defines an operator on HxH ; 
its continuous extension to H'?) , denoted again as Up, is unitary (Problem 10a). 
Consider now states W and Wp := UpWU5' of the composite system. If they 
are pure and described by vectors y and wp := Upw, then the reduction formulas 
(11.4) yield W;(W) = W3_,(Wp) for 7 = 1,2 ; by Problem 5b this conclusion 
extends to any admissible state W. The roles of the particles in the states W 
and Wp are therefore switched, and the principle of indistinguishability is nothing 
else than the invariance of the system with respect to particle interchange. By 
Proposition 10.1.4, any observable A of the pair has then to commute with the 
operator which represents the transformation, 


UpAC AUp. 


Define further the operators Sj := 3([+Up) and Ay := 4(I—Up) ; since UZ = 1, 
they are mutually orthogonal projections such that S»+ A> = 1. The above relation 
then says that any observable of a pair of identical particles is reduced by the 
projections S2 and A». It appears as if the identity would require introducing a 
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new superselection rule. However, we know from experience that admissible states 
always belong to only one of the subspaces, i.e., that any state of two identical 
particles satisfies just one of the conditions 


Upp = +. | (11.8) 


These transformation properties are referred to as Bose-Einstein statistics in the 
case of the upper sign when w € S2H®) , and Fermi-—Dirac statistics for the lower 
sign with 7 € AoH®). 

It is known that the behavior of the particles is governed by Bose-Einstein 
statistics if the particles involved have an integer spin, and by Fermi—Dirac statistics 
in the case of a half-integer spin; for brevity these categories of particles are called 
the bosons and fermions, respectively. In the framework of relativistic quantum 
field theory, this empirical fact can be explained (see the notes). 

These conclusions extend to the case of a system consisting of n identical par- 
ticles. To describe their interchanges, we introduce the group S, of permutations 
of n elements, which we write as 


ie. 2 mee 1) 

sa Mi Py ees Dade: 
the symbol €, denotes the parity of the permutation, ¢ = +1. Since the particles 
are supposed to be identical, the system is invariant with respect to S, , so by the 


postulate (Q5b) its state space has to support a representation of this group. To 
construct it, we set 


U(p)(¥1 @++:@n) = Up, +++ @ Up, - (11.9) 


for any p € S, and arbitrary ~; € H ; in the same way as above we can check 
that the map U(p) is well defined and extends continuously to a unitary operator on 
H”) := H®---@H denoted by the same symbol. The corresponding transformation 
leads to the appropriate permutation of the reduced states (Problem 11). Moreover, 


U(p)U(p) = U(pp) 


holds for any p,p € S,; in particular, U(e) = I for the identical permutation 
e € S, and U(p~!) = U(p)*. This means that U : S, — B(H™) is the sought 
representation; if n = 2 the group consists of two elements represented by the 
operators J and Up. Next we define the operators 


= 4 UG), A= + Devo), 


We pESn * pESn 


which are easily seen to be Hermitean; the group property of U(p) yields the re- 
lations U(p)S, = S, and U(p)An = €pAn, which in turn imply 52 = S, and 
A2 = Ay ; this means that S, and A, are projections. 
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The indistinguishability principle requires the n-particle system to be invariant 
with respect to the group S,,so U(p)A C AU(p) must hold for any observable A ; 
it follows that 

S, Ac ASs, Bipei@ Ata. 


In this case the realizable states again correspond only to vectors of one of the 
subspaces S,H) and A,H'” depending on the spin of the particles (see also the 
notes); hence the postulate (Q6a) on the state space can reformulated as follows: 


(Q6b) the state space of n identical particles with an integer (half-integer) spin is 
the subspace S,H™) (respectively, AnH” ) in H® := H@---@H, where 
H is the state space of one particle. 


Recall the physically most important consequence of this postulate. 


10.4.1 Example (Pauli principle): Consider a system of n identical fermions in a 
state = @-:-@Un € AnH™. Suppose that ~; = vx holds for a pair of different 
indices j,k, and let pj, denote the transposition of the j-th and k-th elements. 
Its parity is negative so U(pj,)An = —An. We have A,w =~ by assumption, and 
at the same time, U(p;,)y = y%. Combining these relations, we get y = 0 ; this 
means that in a system of identical fermions no two can be simultaneously in the 
same pure state. 


11.5 Separation of variables. Symmetries 


The more degrees of freedom a quantum system has, the more difficult it usually 
is to treat it. We have seen, however, that the task simplifies considerably if the 
system can be decomposed into noninteracting subsystems, either real or fictitious. 
It happens often that the Hamiltonian H itself is not of the form H, +H, but we 
can pass to a unitary equivalent operator U-!'HU which already has this property; 
most frequently U is some “substitution” operator of Example 3.3.2. In this section 
we are going to illustrate the method by discussing two simple situations of this type. 
The first of these concerns the separation of the center-of-mass motion. Con- 
sider a system of two spinless particles with the state space H := L?(R®). The 
configuration-space vectors which appear as the wave function arguments will be 
denoted as (21,22), where x; = (xj1, 252,233) refers to the position of the j-th par- 
ticle, and Q;, are the corresponding coordinate operators; the same double-index 
notation will be used for the momenta. Suppose that the potential V: IR®° — R 
by means of which the particles interact depends only on their distance, i.e., it can 

be expressed as 
V(x1,22) = v(2z2— 2) (11.10) 


for some Borel function v: R* — IR. The Hamiltonian of the two-particle system 
is then the closure of 


= 
2m, 


1 
P+ oP +V|y, 


Hs Hy = | 2m 
2 
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provided the operator in the square brackets is e€.s.a., where ae := Ty with the 
domain Dy := {y € L7(R*): Vo € L°UR®)}, P? := Di, P,, and m, is the 
mass of the j-th particle. We introduce the conreroismass and relative position 
vectors 

X := (101+ fete, F i= 2-71, 
where p; := m;/M and M := m+ mz is the total mass of the system, and use 
them to define the substitution operator U on L?(JR®) by 


(Uy) (a1, 22) = ¥(X,2). (11.11) 


It follows from Example 3.3.2 that U is unitary. Its inverse is (U-')(X,xr) = 
W(X —p2x, X+p12) ; it is easy to see that both U and U—! preserve S(JR*®). Next 
we consider the operator of multiplication by the function 


Wie) uae 


A simple substitution argument gives Dw := U-!Dy and Wy = U~!VUy for 
any ~ € Dy, i.e., the operator identity W = U-!VU. As for the momentum 
coordinates, it follows from the relation (7.5) that 


U"PyUd = —iU~"|piU (O16) — U(Oxd)) = wi Pied — Pord , 
U"PyUd = —iU~"[poU (Oud) + U(Oxd)) = p2Pird + Pod 


for any ¢ € S(JR’) , so an easy computation yields 


i 
Uo | + 5 2| Ud = = pe ae Fie 


(Problem 12a), where ps := mmM2/M is the reduced mass of the particles. Together 
we get the identity 


1 


UHUYy = p? + ree +Wlo 


E 
for any w € S(IR®)M Dw. Consider now a dense subspace L?(JR*) such that 
Dc S(IR?)N D,. The two-particle state space decomposes as L?(JR®) = L?(JR°) @ 
L? UR) ; using the definition of the function W we find SUIR*)x D c S(IR°)N Dy , 
and therefore 

UHU} (SUR®)£D) = Tom @I+1® Hyet , (11.12) 
where T.., and H,.; are operators with the domains S(JR*) and D, respectively, 
defined by 


i! i! 
= —— H, = —— ; 
(Oe Ad, rel Du Ay + vy 
The operator Tam is e.s.a. by Problem 7.16; hence if we are able to choose D in 


such a way that H,.; is also e.s.a., the same will be true for Tom @I+1® Hyer due 
to Theorem 5.7.2; this in turn implies that S(IR*)xD is a core for H. 
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Instead of checking the essential self-adjointness of operator H , we thus have 
to solve the simpler problem of finding a suitable subspace D Cc S(JR*) MD, such 
thate Aye, = (— 7, 4 + v)})D is e.s.a. Furthermore, it follows from (11.12) and 
Problem 4.27d that 


Uae = Tom © 1 +1 Are = Tom ®@14+1® Bye, 


so other properties of the two-particle Hamiltonian can also be derived from those 
of the operator Hre: ‘cf. Problem 12). In particular, due to Proposition 11.3.1 and 
Problem 5.12 the corresponding unitary propagator is expressed as 


U-leHtyy = exp(—iT mnt) @ exp(—iHreit) ; 


in addition, the center-of-mass part is known explicitly because Tan is the Hamil- 
tonian of a free particle of mass M ,so exp(—iT ant) is given by Theorem 9.3.1 with 
n=3 andm;=M. 

Concluding this discussion, we see that if the two-particle potential is of the 
form (11.10), the original problem can be reduced to analysis of the Hamiltonian 
H,.1 that describes a single particle with the reduced mass p. 


11.5.1 Remark: In a similar way, we can separate the center-of-mass motion 
in systems of N particles interacting via potentials, which depend solely on the 
differences of the particle position vectors (Problem 13). The new feature here is 
that while in the two-particle case (11.11) represents the only natural choice of the 
corresponding operator, for N > 2 there are different possibilities related to different 
relative coordinates. The two most frequently used are the atomic coordinates 


Ys=2j-tN, jHl,...,N-1, (11.13) 


where the positions are related to a chosen particle, and the Jacobi coordinates 


F : -1 
J J 
25 = Lj41 — SY mix; (sm) ’ j= lywe--,N—l. (11.14) 
i=l t=] 


The former are suitable in situations where some particle is privileged, for example, 
being much heavier than the rest. On the other hand, z,; in (11.14) describes the 
relative position of the (j7+1)-th particle with respect to the center of mass of the 
preceding j particles, and therefore the ordering is essential in this case. 


The second example we are going to discuss concerns particle motion in a cen- 
trally symmetric potential, i.e., we consider the operator (9.2) on L?(JR*) with 
m, = M2 = m3 =: m and the function V such that 


V(x) = v(r),  ri= Vz} +23 +23, (11.15) 


for some Borel v: R+ > R. Let S = S? be the unit sphere in IR?. The measure 
Q on S is defined using the spherical coordinates, 


Z1=rsinOcosy, z2=rsin@siny, 23 = rcos6, 
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by dQ(6,~) = sin@ dé dy ; it is invariant with respect to rotations of the sphere, 
because an arbitrary rotation can be expressed as a sequence of rotations around 
the axes (Problem 14a). Moreover, we can introduce the operator U : L?(IR%) > 
L?URt x S, drdQ) by 
(Up)(r, 0,9) = rp(a). (11.16) 

It is easy to check that U is an isometry of the two spaces; recall that the latter 
can be identified by Example 2.4.5 with L?(IR*) @ L7(S,dQ). 

Before we use the operator U to transform the Hamiltonian, we need to know 
something about the space L?(S,dQ). It contains, in particular, the spherical func- 
tions Yi, defined as 


al+1 (1—|ml)!]/? im i 
a eae 


forany 1 =0,1,... and m= —l,—/+1,...,1, where P” are the so-called associated 
Legendre functions on the interval [—1,1] which are given for n = 0,1,... and 
l=n,n+l,... by 


Yim(9,) == (-1)™ | 


FP) : = (uz apne s dz Ren a ie 
We define the operator A on L*(S,dQ) by 
aS) Og 1 69 


then the following result is valid. 


11.5.2 Proposition: Yj), are eigenfunctions of A: AYim = —l(l+1)¥Yim, and 
{Yim : 1=0,1,..., m= -—l,...,1} is an orthonormal basis in L?(S,dQ). 

Proof: The first assertion follows from the fact that the functions P” satisfy the 
equation (1—z7) f” — 2zf’ + [l(l+1) —n?/(1—z?)| f = 0. To check that the spherical 
functions form an orthonormal basis in L?(S,dQ), we express the latter in the 
tensor-product form L?(0,27) @ L*((0,7),sin@d6) and notice that the last named 
space is isomorphic to L?(—1,1). In view of Example 2.2.2 and Problem 2.24, it is 
sufficient to show that the set 


a 1/2 
{ (24 =) Ft tennt.| 


is for any n = 0,1,... an orthonormal basis in L*(—1,1) ; this is true, however, 
because the functions in question are obtained by orthonormalization of the family 
{ze (1—2?)"/?z7 : 7 =0,1,...} which is total due to Example 2.2.2. I 


Now we are able to use the operator (11.16) to transform the Hamiltonian. 
Choose D := S(JR*)M Dy as the domain of H ; then a straightforward computation 
yields 


UHU"6 = ~ 5 (@ += =A) ® : vb (11.18) 
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for any ® € UD (cf. Problem 14c). By Proposition 2, L?(S,dQ) = @j2o% , where 
G, is the subspace spanned by Yim, m= -—l,...,1; it follows that 


UL2UR®) = L2(Rt, dr) @ L°($,d2) = @ LR) @Gi. 
l=0 


Let D, be the domain of the operator of multiplication by the function v, ze., 
D, := {9g € L?(Rt) : vg € L?(IR*+)} and suppose for a moment that for any 
1 =0,1,... we can find a subspace D® c Cg°(IR* \ {0}) MD, in such way that 
the operator h; defined on D) by the differential expression 


ae E = oe) + v(r) (11.19) 


dr2 r2 


is e.s.a. Then the operator h;@ I; on L?(JR*+) @G, with the domain D,; := { fYim: 
f ¢D®,m=-l,...,l1}, where I; is the unit operator on G,, is again e.s.a. by 
Theorem 5.7.2. Since D; C US(R°) due to Problem 15b and D” Cc D,, it follows 
from Theorem A.3.11 that 


[Velde =f MUVd)(r,8,p)Pdr dO = |lvf¥iml? < 00 


holds for = U~!(f¥im) € SUR’), and therefore D, Cc U(S(IR°)N Dy) = UD. 
Now the relation (11.18) in combination with Proposition 2 yields the identity 


UHU"}D, =h@h. 


Next we construct the operator H as the algebraic direct sum of h; @ I; i.e., its 
domain D Cc UD consists of all finite sums © = \, & with ¢ € D,, and the 
operator acts as H® := > (hi @ h)d for any ® € D. It is easy to see that 
Hc Sm H (see the notes to Section 4.4), so the closures of the two 
operators coincide, and since h; @ fl; are e.s.a. the same is true, by Problem 4.29, 
for H, and a fortiori, for its symmetric extension Ba h, ® I;. Moreover, we have 
UHU-) D = H, and therefore 


UHU" = >°hnoh (11.20) 


t=0 


due to Problem 4.27d. In this way we have reduced analysis of the operator (9.2) 
with a centrally symmetric potential to treatment of the sequence of ordinary differ- 
ential operators (11.19); this procedure is usually referred to as the partial-wave 
decomposition. We have demonstrated that if all the operators h; are e.s.a. the 
same is true for H. In that case the spectrum of H is determined by the spectra 
of hy: using Problem 4.24b and a simple induction argument we find 


o(H) = U o(ht) 


i=0 
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and similar relations for the other types of spectra. 


11.5.3 Remark: The presented construction generalizes to operators (9.2) on 
1/2 

L?(UR"), n > 2, with a potential V which depends on r := be x) ” The gene- 

ralized spherical coordinates can be introduced as the map 2 : IR+x S"-! > RR”, 

where S”~! is the unit sphere in JR”, defined by the recursive relations 


a” := 2? sind,-9, k=1,...,n—-1 


x := 7 cos On_2, 
with 2) = pees and a? := r sing ; the parameter y runs through (0,27) 
and 6;, 7 = 1,...,n—2, through (0,7). The map Un: Unw = r-)/poa 
then defines a unitary operator from L?(JR") to L?(IR*+ x S"—1,drdQ,), where 
diy (es sin’ 0;) dypd6,...d@,-2. The relation (11.18) is now replaced by 


1 = 
Un (-5-A+V) al 
(11.21) 


i 


eee. main 3) 1 
~ 9m | Or? 4r2 


a 72 An g+ud, 


where A, is the so-called Laplace—Beltrami operator on S"-! (see the notes). We 
can again show that there are finite-dimensional subspaces i GEOG) 


L?(S"-1,dQ,) such that L?(S"-!,dQ,) = 2g!” and 
AnY = —l(l+n—2)Y 


holds for all Y € G{”) , so instead of — =, A+V we have to analyze the correspon- 
ding countable family of ordinary differential operators on L?(JR*). 


In both the examples discussed here, symmetry of the problem played an im- 
portant role. Decomposition into partial waves is based in fact on a reduction of the 
rotation—group representation generated by the angular momentum operators into 
irreducible components. On the other hand, if a two-particle potential is of the form 
(11.10), we are able to decompose the corresponding system into a pair of fictitious 
subsystems; among these the subsystem associated with the center-of-mass motion 
is free, and therefore invariant with respect to translations — cf. Example 10.2.1b. 

It frequently occurs that more than one subsystem of a composite system exhibits 
asymmetry. Often the subsystems have the same symmetry group. In order to look 
more closely at transformation properties of composite systems, consider again for 
simplicity the case of a system S consisting of a pair of subsystems 5, S2, and 
suppose that transformations of a group G can be applied to both of them, and that 
they are realized on the state spaces ; by means of representations U;(-), 7 = 1,2; 
the subsystems are invariant with respect to G iff their Hamiltonians H; commute 
with U;(g) for all g. In view of the postulate (Q5b), the transformation group G 
has to be represented on the state space H := H, @ Hz. We assume that 
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(Q6c) states of S transform with respect to G by means of the representation 
U: GH defined by U(g) := Ui(g) @ U2(g) for any g EG. 


It is evident that U(-) is a representation of G. In most situations, assumption 
(Q6c) can be justified easily. In particular, if G is a continuous group such that the 
representations of its one-parameter subgroups are generated by observables having 
an additive character, we can use Proposition 5.9.6; an illustration is given by the 
addition of angular momentum and spin in Example 10.2.4 or by Proposition 11.3.1. 

It is therefore reasonable to postulate this relation between transformation— 
group representations for all composite systems. Using it, we obtain the following 
assertion. 


11.5.4 Proposition: If the subsystems are invariant under G and do not interact, 
the composite system is also invariant with respect to G. 


Proof: By the spectral theorem, a system is invariant under G iff its propagator 
commutes with the corresponding representation of G ; the result then follows from 
Proposition 11.3.1 and Theorem 4.5.2a. 


We have seen how a reduction of a symmetry—group representation to irreducible 
components simplifies analysis of a system. It often happens, however, that the 
representation U(-) obtained as the tensor product of the subsystem representations 
is reducible, and we have to perform its reduction. A simple illustration is given in 
Problem 16; more information about these reductions for various groups — which 
are a frequently used tool in studying symmetries of quantum systems — can be 
found in the literature quoted in the notes. 


Notes to Chapter 11 


Section 11.1 A deeper justification of the postulate (Q6a) within the axiomatic approach, 
which will be discussed in Section 13.1, can be found in [ AD 1,2]. We cannot conclude 
from Example 6 that any self-adjoint operator reduced by the total charge eigenspaces is 
an observable of the nucleon-pion system, because the neglection of the nonisospin degrees 
of freedom is a rather crude simplification. If we should want to justify assumption (11.1) 
it is worth keeping in mind that in quantum mechanics, where we typically deal with finite 
systems of particles which preserve their identities, with a few simple superselection rules 
at most, algebras of observables represent a useful ingredient of the formalism but they 
are not indispensable. The charge-exchange processes mentioned in the example belong 
in fact to quantum field theory; we mentioned in Section 7.6 that assumption (11.1) is not 
valid there even in the coherent case. 


Section 11.2 The restriction to the quantum mechanical case with the algebra of ob- 
servables of the form (11.1) is natural, because in other cases statistical operators may not 
be an optimal tool for description of states. Notice also that nowhere in this section do 
we need to know the algebra of observables of the composite system. We repeatedly use 
countable orthonormal bases without assuming separability of the state spaces involved; 
this is made possible by the fact that the series in question always have only countably 
many nonzero terms — cf. Remark 6.4.2. 
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An attentive reader may wonder why we speak about a pair of electrons in Example 2 
when we declared above that the choice of the state space based on the postulate (Q6a) 
does not apply to systems of identical particles. The answer is that the considerations of 
Section 11.4 concern the whole state space; in this particular situation the two electrons 
can exist in the triplet as well as in the singlet spin state, but a different symmetry of the 
configuration-space part of the wave function corresponds to each of these cases. 

The relations (11.4) show that the reduction of a pure state leads in general to mixed 
states; in quantum mechanical textbooks these are usually referred to as mixed states of 
the second kind. It should be stressed, however, that it is irrelevant for the formalism 
whether a mixed state is obtained in this way or whether it expresses our incomplete 
knowledge of the state of a particular member of some family of identical systems. 

- Reduction formulas (11.4) can be written in an elegant form provided we use the 
particular realization of the tensor product mentioned in Problem 3.39b — see, e.g., [Jal], 
Sec.11-8. This expression, as well as the norma! form (11.5) of the reduction, applies only 
to the case of a system consisting of two subsystems, while the relations (11.4) generalize 
easily to any finite number of subsystems (Problem 7). 


Section 11.3 We have mentioned that we are not always interested in the time evolution 
of reduced states. Recall two typical situations where we encounter systems consisting of 
interacting subsystems 5S), Sp. A bound state of the subsystems is a pure state described 
by an eigenvector of the total Hamiltonian H. Such states are stationary and the same 
is, of course, true for reduced states. On the other hand, if we investigate the scattering of 
S2 on S;, we compare the full propapagator U(t) = e~*#* of the composite system with 
Uo(t) = e~*#1' @ e-*#2t corresponding to the case when the subsystems do not interact; 
loosely speaking, the time behavior of the reduced states is here of interest in the region 
where the reduction is trivial. More will be said about bound and scattering states in 
Section 15.1. 


Section 11.4 A more detailed discussion of the concept of identity in quantum theory 
can be found in [Ja], Sec.15-3. The connection between the type of a particle and the 
transformation properties (11.8) of its states is usually referred to briefly as the relation 
between spin and statistics. In quantum mechanics it has to be regarded as an empirical 
fact whose most striking manifestation is the Pauli principle discussed in Example 1; it 
has deep physical consequences which we shall mention in the notes to Sec.14.3. 

On the other hand, in quantum field theory the relation between spin and statistics 
denotes a result closely related to the mentioned transformation property, namely that field 
operators in points which are causally inaccessible, 7.e., separated by a space-like interval 
cannot commute (anticommute) if the particles associated with this field are fermions 
(respectively, bosons). In the framework of axiomatic quantum field theory this claim can 
be demonstrated — see the notes to Sec.13.3. 

The permutation group S, and its representations are discussed, e.g., in | Ham ], 
Chap.7; [BaR], Sec.7.5. If n > 2 we have S,+An # J ; this is related to the existence of 
other irreducible representations of 5S, than the two we have employed here. We naturally 
ask whether there are systems with a permutation behavior different from Fermi—Dirac 
and Bose-Einstein statistics. Various parastatistics of this type have been studied (an 
extensive discussion of the problem is given, for instance, in [OK ]) but they remain until 
now a mere mathematical construction. 


Section 11.5 The Laplace-Beltrami operator Ay is defined on a Riemannian manifold 
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M with a metric tensor g by (Amw)(x) := —9(x)7'/?(A,9(x)/29¥ (x)0;p)(x) , where 
8; := 8/AxI and g(x) := det(g(x)). Notice the sign convention: in the case M = JR” 
when gi; = 6:; we have Ay = —A. Properties of the operators Ap := Aga-1 for n = 2,3 
can be demostrated directly as we did in Proposition 2; for the general case we refer, e.9., 
to the monograph [ Miil]. Examples of partial wave decomposition for n > 3 can be 
found, for instance, in [Ba 4], [CH 1], [Ex 2]. 


Problems 


1. The conclusions of Example 11.1.6 remain valid even if we also consider the other 
degrees of freedom of the two particles. 


2. A family S c B(€™) is irreducible provided it contains at least n noncommuting 
Hermitean operators. In the case n > 3 this condition is sufficient but not necessary; 
what is the least number of noncommuting Hermitean operators that an irreducible 
S must contain? 

Hint: Consider the operators 51,93 of Example 7.5.4b. 


8 Let H=H, @Hp2. If a sequence (B} Cc B(H,) converges weakly to an operator 
By, then w-limp.0B) = B,. 


4. Let H= HF) @ He. Prove: 


(a) if C; € Jp(H;), j = 1,2, for some p > 1, then C; @ Cy € J,(H) and 
!C1 ® Callp = [ICallp Cillp » 


(a) if the operators C; are of the trace class and B; € B(H;), 7 = 1,2, then 
Tr (BiC; @ BeC2) = Tr(B1iC;) Tr (BeC2), 


(c) the tensor product of statistical operators is a statistical operator. 


5. Prove: (a) Let the state W of the composite system be pure and W = W, @ W2, 
where the W; are realizable pure states of the subsystems; then the reduced 
states are also pure and satisfy W;(W) = W;, j =1,2. 


(b) Suppose that the state of the composite system is described by the statistical 
operator W = )>,w;E"), where {E%)} is a sequence of one-dimensional 
projections; then the reduced states are W;(W) = 3°; wjW;(E) (strong 
convergence). 


6. Suppose that { E,: a € J} is the projection family corresponding to the coherent 
subspaces in H and W € J(H) is a statistical operator; then the operator 


W := }> EaWE, (strong convergence) 
aeJ 


is again statistical, it is reduced by all E,, and Tr(AW) = Tr(AW) holds for any 
Aé a 7 B(Ha). If W is a projection the same is true for W , while the opposite 
implication is not valid. 
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7. Derive the reduction formulas for a system consisting of n coherent subsystems. 


8 Let H;: IR— B(H;), j =1,2, be strongly continuous Hermitean—valued functions; 


10. 


iULe 


12. 


13. 


then the unitary propagator corresponding to H(t) = H(t) + Ho(t) satisfies for 
all t,s € IR the relation U(t,s) = U,(t,s) @ Uo(t,s), where the U;(t,s) are the 
unitary propagators referring to the Hamiltonians H;(t) . 

Hint: It is sufficient to check that the operators coincide on 7 x He ; use the Dyson 
expansion. 


Check the computations of Example 10.3.3. 


(a) Find U(t) using Example 5.2.3. 


(b) Show that {U(t) : t € IR} is a one-parameter unitary group, and compute 
its generator. 


(c) Find the reduced states. 


Prove: (a) The operator (11.7) is well defined, i.e., ~ =0 implies Upw = 0, its 
continuous extension is unitary, and UZ =. 


(b) Generalize these conclusions to the operators U(p) defined by (11.9). 


Let W bea state of n identical particles, and denote W, :=U(p)WU(p)~! for 
any permutation p € S,, where the operators U(p) are defined by (11.9); then the 
reduced states satisfy W;(W) = W,,(Wp), j =1,...,n, where r:= p™! 


Consider the Hamiltonian of two nonrelativistic spinless particles with the potential 
(11.10) and suppose that the operator H,.; is e.s.a. . 

(a) Prove the relation (11.12). 

(b) Find the spectrum of H. 


(c) Given a spectral representation of H,e, construct that of H. 


Hint: (b) Use Example 5.7.5. 


Consider the Hamiltonian Ho := ye i@ny)"P? describing a system of N nonin- 
teracting spinless particles. 


(a) Set U: aha in) =a Xyercoengnmts) omit se! Syl nmagay, 
where M := ppd "1m, is the total mass of the system and y; are the atomic 
coordinates (11. 13); then 


N-1 N-1j 
U- Ho = genet ye oe 
=1 N jal C=) 


at pat 
> Pet et, 


where the P: on k = 1,2,3, are the momentum components corresponding 
to the atomic coordinates of the j-th particle, P?*.P% := Sp, Paps and 


oo Somer -1 
Hh; i=M;, +My - 
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(b) Introducing the analogous substitution operator for the Jacobi coordinates 
(11.14), we have 


1 Nesey 
UH =i Dames) 
2M » 2; 7 


where the momentum component operators PH correspond to the Jacobi co- 
ordinates of the j-th particle and py = mel, + (SoLyeni). 


14. Let 7, Y, 01,...,On—2 be the generalized spherical coordinates in R®” introduced in 
Remark 11.5.4. Prove: 


(a) The measure 2 on S”~! given by dQ := TTjai sin? 6;dyd0,...dOn—2 is 
invariant with respect to the rotations of the sphere around the k-th axis, 
| ee 


(b) The map U, : L?(IR") > L?(IR*+ x S"~1,drdQ) defined by 
(Un) Os Y,; 61, OR TaC) On—2) =a r—1)/2 (hoon) (y, Y, 1, cee »On-2) 


is an isometry of the two spaces. 
(c) Check the relation (11.21). 


15. Prove: (a) Let U be the operator (11.16). Show that for any f € Cf°UR*t\{0}) and 
Yim there is a function y € SUR’) such that (Uw)(r,6,) = f(r) Yim(@,~) - 


(b) Check that the subspace C§°(IR*+ \ {0}) is dense in L?(IR+,r%dr) for any 
a € R, and similarly, C§°(IR" \ M) is dense in L?(JR") for a positive integer 
n provided M is an at most countable set without accumulation points. 


16. Consider the two-electron system of Example 11.2.2 and denote by a = 5ok the 
spin-component operators of the j-th electron. 


(auleb Spx al + se) be the components of the total spin; find the spectral 
decomposition of the operator S? := >}_, S?. 


(b) Construct the corresponding representation of the rotation group. 


Hint: Cf. Example 10.2.4. 


Chapter 12 


The second quantization 


12.1 Fock spaces 


Now we want to show how some results from the preceding chapter can be extended 
to situations, where the number of particles is not preserved. The first question is 
what kind of a state space can be associated with such a system. We start from the 
one-particle Hilbert space H and again denote H™ := H@---@4H;; in addition, 
we set H) := (. The direct sum of these spaces, 


F(H) = on™, 


n=0 


is then called the Fock space over H. Hence if the norm and inner product 
in H™ are indexed by n, the elements of F(H) are sequences V = {1),}%, 
with , € H™ such that 25 |lvnll2 < oo ; the inner product in F(H) is 
correspondingly given by 


(6,U)p = 3 (Pay Yn)n- 


n=0 


Suppose that # is the state space of a particle; then it is obvious from postulate 
(Q6a) that pure states of the system of these particles may be described by vectors 
from F(H) ; if the number of particles equals n, they belong to the subspace 
{WU = {yx}: vp =0 for k #n}, which we shall for the sake of simplicity denote 
also as H”). The one-dimensional subspace H) then refers to the state with zero 
number of particles which is called a vacuum; we shall use the symbol 0p for the 
corresponding vector {1,0,0,...}. 

Since we are considering systems of identical particles, not every vector of F(H) 
can be associated with an admissible state. We know that the subspaces S,,H™ 
and A,H™) in H™ are ascribed to systems of n identical bosons and fermions, 
respectively. This concerns systems with n > 2, of course, because it makes no sense 
to speak about the interchange of particles for n = 0,1 and we set S, = An =I in 
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these cases. Now we can define the symmetric Fock space as 


FAH) := 32° S,H™ 


n=0 


and the antisymmetric Fock space as 


Fa(H) := Si AnH™ ; 


n=0 


these play the role of state space if the particles under considerations are bosons 
and fermions, respectively. 


12.1.1 Remark (about notation): The symbols (-,-) and |] - || will be reserved for 
the inner product and the norm in #H ; this space and quantities related to it will 
be characterized by the adjective one-particle; similarly the space H , its vectors, 
operators on it, etc., will be denoted as n-particle. The inner product and the 
norm in H™ and F(H) will be indexed by n and F, respectively. It is also 
useful to introduce the symbol P, as the common notation for the two projections, 
the symmetrizer S, and antisymmetrizer A, ; all the formulas in which it appears 
are valid in both the symmetric and antisymmetric case. In the same way, we 
write F,(H) instead of F,(H) and F.(H), use the symbol {” for the n-particle 
subspace in F,(H) , etc. 


12.1.2 Example: If the one-particle space is H := L?(JR*), then we have H = 
L?UR°") by Example 2.4.5. The Fock space F(L?(JR*)) consists of the sequences 
UV = {Wp}29 of (equivalence classes of) functions wy, : IR°" — C such that 


Pl eho + Do faa, . . . saa) PChary . . Tele erSe 
=i IR3n 


Let us see what the subspace F,(L?(JR°)) looks like. We take any orthonormal basis 
in L*(JR*) and construct an orthonormal basis in L?(JR°") by Proposition 2.4.4b; 
using it together with the definition relation (11.9), we readily find that 


ig a wis nie In) = Yn(Zpiy--->Zpq) 


holds for any permutation p € S, ae. in IR. On the other hand, any y, € 
SpL’ UR") satisfies the relations U(p~!)tn = U(p7!)Sptn = SnUn = Un, 80 we 
have 

Un(Zp,, tee » pp) = Un(21, vee ; Zn) 


a.e. in IR®”; hence the subspace S,,L?(JR°) consists of functions which, up to a zero- 
measure set, are symmetric with respect to any permutation of variables. Similarly, 
Fa(L7(IR*)) consists of sequences of functions which are a.e. antisymmetric with 
respect to permutations of particle positions, 


Vn(Zp,,--- »Lp,_) = pn(21,...,2n). 
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The same results are also valid for other dimensions of the configuration space, i.e., 
for H := L*(JR*) with any positive integer d. 


Let T be a densely defined operator on H ; we shall describe a standard proce- 
dure by which we can construct to it an operator on F,(H). As earlier, the symbol 
TI; will denote the tensor product [@---@1@®T@I®---@I in which the operator 
T stands at the j-th place. For any n > 1 we define the operators 


T= >T;, T= Te---@T 
j=l 


whose domain is D,,(T) := H™ if T is bounded and D,(T) := D(T)x---xD(T) 
otherwise; it is obvious that they are densely defined. In particular, T* = T™=T, 
and we also set T? := 0 and Tj! := J. Then we are able to define the following 
operators on F(H), 


te CG) : TAG) {~n}n=o = 1G Os Un tnao 


T'(T) : T"(T){da}o = {Tr Yn} ro 
with the common domain formed by the subspace 


DT) = { Uy = {dp,...,¥n,0,..-} : Yn € Da(T), N =0,1,...}. 


12.1.3 Proposition: The operators T“(T) and T"(T') are densely defined. Each 
of them is reduced by the subspaces ¥,(#), and their parts in these subspaces, 
which we will be denoted by 7*(T) and EA), respectively, are also densely 
defined. 
Proof: It follows from the definition of the norm ||-||- that toany V = {y,}%9 and 
€ >0 thereisan N such that ||W-Yy||p < de holds for Vy := {yo,..., Wy, 0,...}. 
Using the fact that D,(T) = H™ , we readily find that D(T’) is dense in F(H). 
Next we shall check the assertion about the reduction. Let P the projection 
onto the (anti)symmetric subspace, PY = {Prtn}?2,, and consider an arbitrary 
n-tuple of vectors f; € D(T). We have fp,®-:-@fp, € Dn(T) for any permutation 
p € Sp ; due to Problem 11.10b this means U(p)D,(T) Cc D,(T), and therefore 
also P,D,(T) C D,(T). The definition of D(T) then yields PW € D(T) for any 
We DT), ie, PD(T) c D(T). Let {f;}, {9;} be n—tuples of vectors from D(T) 
and p€ Sn. Denoting r := p-’, we may write 


(91 @-** @ In; U(p)T2 (fr @°++@ fa))n 


= ¥ (Gr, B+ @ Gray fr B+ OTF; @+++®@ fan 


j=l 
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= 5 ey T hi) I] (Gras fe) = 2 (oT fa I Cr =) 
j=l kAj 


= (9, 8-+-@ In; T2U(p)(fi @ +++ @ fa))n- 


The condition D(T) = is in combination with Proposition 2.4.4a then implies 
U(p)Tn( fi © -*: ® fn) = TaU (p)(fi @--- @ fn) for any f; €¢ D(T). This identity 
extends fre to DAT ” and since it is valid for all p € S,, we get PaT™ Un = 
T= Patty for any pn € D,(T). 

Now we take a vector Vy = {o,...,¢w,0,...} € D(T). The inclusion 
PD(T) C D(T) means that PUy = {Poo,..., Puww,0,...} belongs to D(T) 
and 

PT®(T)Wy = {PaTE Yn} 29 = {TE Paa}So = TPT) PUN, 


i.e., the operator T*(T) is reduced by the subspaces ¥,(H). The condition D(T) = 
F(H) implies that the subspace PD(T) is dense in F,(H), so 7;*(T) is densely 
defined. The proof for the operator Le ) is similar; we leave it to the reader 
(Problem 2). & 


The operator T*(T) is called the second quantization of the one-particle 
operator T. The same name is used for its part in the subspace F,(H) ; we 
are, of course, interested in one of these subspaces only, depending on whether the 
particles under consideration are bosons or fermions. Such operators are used to 
describe some observables for systems of identical particles provided the operator 
Ar) is e.s.a. (in that case, its closure is often denoted as dI(T) ; the origin of 
this notation is explained in the notes). 


12.1.4 Theorem: Let A be a self-adjoint operator on  ; then the operators 
T*(A) and T>(A) are e.s.a. 

Proof: According to the definition given in the notes to Section 4.4, the operator 
T*(A) is a restriction of the direct sum TJa(A) := @229 An ; both operators are 
obviously symmetric. Given ® € D(Tg(A)), we consider ‘ee truncated vectors 
Oy := {¢o,...,¢n,...}. Each of them belongs to D(T*(A)) and the sequence 
{Py }¥_1 is convergent; this follows from the fact that S7%o || Andnll2 is finite by 
assumption and 


M 
DL lAngall, = IIT*(A)(Ou— Oy) |?- 
n=N+1 
In this way, we obtain the inclusion J,(A) C T*(A), which implies in turn that 
the closures of the two operators coincide; however, Tg is e.s.a. due to Theo- 
rem 5.7.2 and Problem 4.29b. The essential self—adjointness of T,*(A) follows from 
Proposition 3 and Problem 4.24b. J 


12.1.5 Example: Consider the simplest nontrivial case: the second quantization 
of the unit operator I. The operator T*(I) is e.s.a. by the preceding theorem, 
and o(N) c {0,1,2,...} holds for N := T®(I) (cf. Problem 3). We shall 
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show that this inclusion is in fact an identity. The operators [7 are bounded and 
satisfy I>(f; ®---@ fn) = n(f, @--- @ fa) for any fi,...,fn € H™, 80 we 
obtain J* = nI,,, where I, is the unit operator on H™). Then the self-adjoint 
operator N has a pure point spectrum and the n-particle subspaces H™) are 
the corresponding eigenspaces. Moreover, Dn(I) = H™ are obviously invariant 
with respect to the operators U(p), and therefore also with respect to P,. The 
operators aol ) then act on the subspaces Hi) as multiples of the unit operator; 
their closures AN, := 7,*(I) have pure point spectra, o(Np) = {0,1,2,...}, with 
the eigenspace 1 corresponding to the eigenvalue n. It is natural to call N the 
operator of the number of particles; the same name is used for MN, and also for their 
€.§.a. restrictions. 


Theorem 4 allows us to associate with a one-particle observable A the observ- 
able 7*(A) of the many—particle system under consideration; the index assumes 
the value p= s,a according to the type of the particles. Moreover, it is clear from 
the construction of the second—quantized operator that A and 7(A) describe the 
same physical quantity, at least for observables of additive character such as energy, 
momentum, etc. For instance, if H is the one-particle Hamiltonian, then 7(H) 
is the Hamiltonian for the noninteracting many-particle system. 

This correspondence between the observables induces a simple relation between 
the unitary groups they generate, in particular, between the evolution operators of 
one-particle and many-—particle systems. 


12.1.6 Theorem: Suppose that {U(t): t € IR} is a continuous unitary group 
on H generated by a self-adjoint operator A. Then {7"(U(t)) : t € R} is 
a continuous unitary group on the Fock space F(H) generated by 7*(A), and 
similarly, {7,"(U(t)) : t € IR} is a continuous unitary group on F,(H) generated 
by 7,7(A). 

Proof: The reader is asked to check that {7™(U(t)) : t € R} is a continu- 
ous unitary group (Problem 4a). We denote its generator as A. It exists due to 
the Stone theorem and its domain consists of the vectors Y € F(H) for which 
lim,_.o[T"(U(t))¥ — V]t-! exists. It follows from Proposition 5.9.6 that {UN (Et) : 
t € IR} is generated by A™ := AX ; in addition, AY = A and A = 0 obvi- 
ously holds. We take any vector W E DA), veg h aaion: orgy, 0,...} with 
Yn € Dn{A) ; then 


lim : (TIT yw - W) -i7(A)w [. 
2 
= > jim ( (UR(t) -1) Yn—tAndn|| = 0, 


so T™(A) c A. This implies T*(A) C A, and since both operators are self-adjoint 
they equal each other. The remaining assertion can be proved in an similar way by 
restriction to vectors VE D(A)NF,(H). Ht 
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12.2 Creation and annihilation operators 


In the following we shall assume that the one-particle state space 1 is anata 
Let € = {$;}52, be an orthonormal basis in it; then the vectors $j, ®@ --- ® j, 
with j, = 1,2,... and k =1,...,n form an orthonormal basis in H™ which we 
denote as E,. Unfortunately’ he latter is not very suitable, since we are interested 
primarily in the subspaces P,+) and some vectors of €, have nonzero orthogonal 
components both in P,H™ and in its orthogonal complement. Hence we shall use 
E, to construct another orthonormal basis. 

Any vector of €, is uniquely determined by an n-tuple of positive integers 
7 = {hi,.-+,jn} ; we shall call 7 a variation and denote the corresponding vector 
as ¢n(j). The identity 


U(p)bnlj) = bj, @°*: @9;,, =: Gn(Jop) 


obviously holds for any permutation p € S, ; it is not excluded that the operator 
U(p) maps some basis vectors onto itself, because in general some of the numbers 
jx may coincide. To make clear how the permutations act on the elements of €, , we 
introduce the symbol 7 for nondecreasing variations, i.e., such that 7; <---< jn. 
This ordering makes it possible to define uniquely the numbers of repeated indices, 


Az =In < Jatt = °° = Initna a. < Jaytetnmitl = 9° = nj 


we have, of course, ))j=;7i = n. With the help of permutations, a nondecreasing 
variation 7 gives rise to the subspace Hn(j) := {¢n(Jop) : p € Sn }un ; it is not 
difficult to check that 


n! 


dim H,(j) = a =: c(j). 


Furthermore, the projections P, preserve this subspace, P,aHn(j) C Hn(j) ; we 
shall find the part of FP, in H,(j). Consider the vector 


bn(9) = (Jop) , 


Vn! se. 


where c, is a normalization constant which can be found easily: the sum has n! 
terms, among which just c(j) vectors are mutually different. They form an ortho- 


iis set and each of them is contained in the sum n,!...nm! times; it follows 
that 


P e ~ ean! 
len)? = — (ra! ta!)?e(9) = —E 
n! c{j) 
so we have to choose c, = ) = (n!...mm!)~'/* to make ¢$(j) a unit vector. 


Next we express the action of the symmetrizer Sy on $n(jop) , 


Snobn(jop) = >> dn((jop)op’) = eS dn(jop”) = on()) 
| eee n! p"ESn TN. Cg 
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for any p € S, ; hence by definition of H,»(j) we obtain 
SrHn(j) = {On(J) bin ’ 


t.e., Sa} H,(J) is the projection onto the subspace spanned by the vector ¢5(J) . 

Similar conclusions can be made for the antisymmetrizer A,. First of all, it is 
obvious that A,Hn(j) = 0 if a number 7, repeats in the variation 7. If 7 is a 
variation without repetition, the same argument as above yields 


AnHn(9) = {$n(J) }iin ; 


where the unit vector ¢°(j) is defined by 


6D) = = DO eobnl er). 


* pESn 


The subspaces H,,(7) are mutually orthogonal and it follows from the construction 
that {En}in = DP? Hn(j) C H™ ; however, €, is by assumption total in H™ , and 
therefore 


yO Ho 
a 
Combining this with the above results, we get 


PyH™ = 30° PoHnl§) = 2° {4 iin 


j 


where we put ¢2(j) = 0 if some number is repeated in 7. This relation shows 
that the vectors $3(j), with 7 = {j1,...,jn} being an arbitrary nondecreasing 
variation, form an orthonormal basis in S,H'” , and similarly the ¢2(j), where 7 
is any nondecreasing variation without repetition, form a basis in A,H™ . 

Given a variation j, we can associate with it the sequence {n;}%, where n; 
denotes the number of elements of j7 equal to 1. This fact may be understood so 
that the state ¢; occurs in ¢n(j) (is occuppied) just n; times; this is why {n,} is 
referred to as the occupation-number sequence (or briefly, ON sequence). It has m 
nonzero terms n;,, 1<k<m. Clearly, m <n and 


oo m 

on — Ym, = 15 

t=1 k=1 
the family of ON sequences which satisfy this condition will be denoted as O%. 
It is easy to check that the map j +» {n;}%, is a bijection between the set of 
all nondecreasing variations and ©%, so we may identify them in the following, 
5 = {n}S, ; for the sake of simplicity we shall write @,(7) =: dn{ni}, etc. Ina 
similar way we find that the subset of variations without repetition corresponds to 
the family O2 C O% of all sequences {n;}%2, composed of the numbers 0 and 1. 


Let us summarize the results. 
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12.2.1 Proposition: The set €? := {¢?{ni} : {ni} € O2} is an orthonormal 
basis in P,H™ for any n> 2. 


The vectors ¢2{n;} can be used to construct an orthonormal basis in F,(H) 
which we denote as €,. According to Problem 2.15, it consists of the vectors ®2 {ni} 
with n=0,1,... and {nj} € OF defined by 


{Goes} 6 on=0 
@?{n:} = ¢ {0,¢1{ns},0,...} oe 
{0,...,Oy@2{naeO,...} ... mea 

where ¢:{n:} := $; for that j for which nj; = 1 and the nonzero component 


stands at the n-th position. Each basis vector is thus determined by a number n 
and some ON sequence {n;} € OF ; owing to this €, is called the occupation— 
number basis. 

After this preliminary let us pass to the subject indicated in the title. We 
associate with arbitrary fi,...,fn € A an element of F,(7H) defined by WV? := 
{0,...,0, Pa(fi@---@ fn), 0... }. The set of such vectors will be denoted as Dw”) ; 
for n= 0 we set Do := {V5}, where Wf := Q. We also introduce the set 


oo 
= { U py | 
n=0 lin 


which is dense in F,(#1) because it contains the linear envelope of the basis €,. For 
any f € H we define the creation operator a*(f) and annihilation operator 
a(f) as the maps D, +> D, obtained by linear extension of the relations 


a SE tin sine a vn +1 Wrarlf, JS umass fn), 
(12.1) 
aye a h,..-, = ALE i(f, 1s (Coe fe ee Te 


where 6”) = 1 for p=s and 6”) = (-1)* for p=a. The second of the relations 
(12.1) holds for n > 1, while for n = 0 we have 


a(f)N% = 0. (12.2) 


Notice that the map f + a*(f) is by definition linear while f + a(f) is antilinear. 
Using the definition of a(f) we are able to express any vector of DY) in the form 


W(fiyeeyfn) = 5 2"(fi)---0"(fa) Mo. (12.3) 


Correctness of the definition requires that the relations (12.1) define a linear mapping 
on DM), i.e., that 


a®(f)PR(Si,--- afetGe,---. fa) Sea (POE(..., fay...) eG gn, -.-) 
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holds for any fi,...,fn,9e € H and a € € ; for the sake of brevity, we have 
introduced here the symbol a*(f) as a common notation for a*(f) and a(f). The 
linearity can be readily checked, and therefore it is sufficient to know how the op- 
erators a*(f) act on vectors of a suitable linearly independent subset in D, , for 
instance, the occupation—-number basis constructed above. To be able to rewrite the 
relations (12.1) for this particular case, we define for any {n;} € O% the sequences 
{ny r= {my,...,Meo1, Me + 1,Me41,...} and the numbers s, := D*=} n,;. Nat- 
urally, the sequence {ni} O is defined only if n, > 1. If f is some vector of the 
one-particle basis {¢,}52, , then the relations (12.1) in the antisymmetric case yield 


ll 


a* (x) B2 {ni} = (—1)*(1 — ne) 2, {na}, 


(12.4) 
a(d) B{n} = (—I)**ne Be_afma} 


(Problem 6); we take into account only ON sequences {n;} € O%. In the symmetric 
case we similarly obtain 


a* (dn) {ns} = Ve +18, ,{ni}h?, 
(12.5) 
alg.) Bi{ns} = Vre BE_ {ni}. 


If f is a general vector in the one-particle space we use the fact that the (anti)linear 
dependence of the operators a*(f) on f may be extended to “infinite linear com- 
binations”: we have 


a(fU = Yn) a'(é)V, af = el br)a(d,)Y (12.6) 


for any V € D, where the series converge with respect to the norm in F,(H) — cf. 
Problem 7. It is obvious from the definition (12.1) that the operators a*(f) map 
the space D, onto itself; this means that polynomials in creation and annihilation 
operators are well defined. An important role among them is played by 


_ {[ a*(f)a*(g) — a*(g)a*(f) . pes 
[a* (f),a*(g)]p = a*(f)a*(g) + a*(g)a*(f) ae p=a 


i.e., by the commutator and anticommutator in the symmetric and antisymmetric 
cases, respectively; the latter is often alternatively denoted as [a*(f),a*(g)]4. 


12.2.2 Theorem: Creation and annihilation operators satisfy the relations 


la(f),a(9)}phY = [a°(f),a°(g)|p¥ = 0 


(12.7) 


[a(f),a°(9)pY = (f,g)¥ 
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for any f,g € H and W € D, ; moreover, a*(f) = a(f)*}Dp and at y= 
a*(f)*)D,. In the symmetric case the operators a#(f) are unbounded for any 

nonzero f € H, while for p=a they are bounded and ||a*(f)|| = || fl. 

Proof: The subspace D, is dense in F,(7) , so the adjoint operators a*(f)* exist. 

Let n > 2; the definition relation (12.1) together with Problem 8 gives 


(WF (91, see Gedy FUE bas oaeu* fale 


= 7 SP) (Gers (Gras £2) «++ (Gras fn) 


*  r€Sn 


“—o oP (ge, f)(VR-1 (91;- ++) Gk—1) Jk+1)+>- on fs, -- . iJn))F 


= (a(f)VR(g1,---19n), Uni (far---> fn) F 


for arbitrary fo,..-, fns91,)-++;9n € H; we have introduced here the symbol v4?) , 
which is v®) = 1 for p=s and v) =«, if p=a. It is straightforward to check 
that this relation holds for n = 1 as well. Inner—product linearity then implies 


a4 Gi: ->9n),0°(f)V)r = (a(f)¥2(91,---59n), VU) F 


for any VE Ooo) ue. Furthermore, (V,a*(f)®)r =0 for Ve em; Pe pm” 
unless n—m = 1. We infer that a*(f) = a(f)*} D, ; using the identitities 


(UW, a(f)®)r = (V,a°(f)"O)r = (a°(f)¥,®)r, 


which are valid for any VY, @ € D,, we also prove a(f) = a*(f)*) Dp. 

As for the relations (12.7), it is sufficient to check them for ¥ € D™”) . The iden- 
tity [a*(f),a*(g)|>¥ = 0 follows from the definition (12.1) and W?(fp,,..-, fo) = 
vEWE(fi,---,fn) ; the first line is completed with the help of Problem 9. To prove 
the second relation, we use (12.1) to express 


a(f)a"(g)Uh(fi,-++s fr) 
= (f,9) Wahi tn +¥ Pr Fah. -subasdhaifagtenss fo) 


f=2 
= (f,9) Ualfis---> fn) + a*(g)a(f) UA(fi,---s fn); 


where the upper and lower sign correspond to p= s and p=a, respectively. 

Consider now the symmetric case; in view of the (anti)linearity we need to prove 
that the operators a*(f) are unbounded for each unit vector f € H. This is in 
fact easy: the vector W%(f,..., f) also has a unit norm, so choosing the basis {;} 
in such a way that $1 = f , we obtain from (12.5) |la*(f)¥s(f,..., fle = Mn+ 1 
and |la(f)¥;(f,..-, file = Jn. 
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In the antisymmetric case we first take the operators a*(¢,) corresponding to 
some vector of the basis {¢,;}32,. The vectors ®¢{n;} and ®%,{m,} corresponding 
to different ON sequences {n;} and {m,}, respectively, are mutually orthogonal; 
hence it follows from (12.4) that also a*(¢,)®%{n;} 1 a*(d,)®2, {mi}. This yields 


for a linear combination W := aj 04 {ne )} the inequality 


la*(be)Vlle = Yo lasl? 
J 


* a } : j 
a" (bn) ®2, {rf $1 = D> los? — my”)? < (WF, 
uy 


which turns to identity, e.g., for VY = Q. However, vectors of this form are dense 
in ¥,(H), and therefore ||a*(¢,)|| = 1. Finally, if f is an arbitrary nonzero vector 
we choose the basis {¢;} so that ¢, = f/||f|| ; the linearity of a*(-) together with 
the already proven property of the adjoint gives ||a(f)|| = |la(f)*l| = lle*(A ||. 


The importance of creation and annihilation operators stems mainly from the 
fact that we are able to express in terms of them other operators acting on the 
space F,(7#1) — see the notes. The relations (12.7) are called the canonical com- 
mutation (anticommutation) relations in the symmetric and antisymmetric 
situation, respectively. An investigation of them is more difficult in the former case 
due to the unboundedness of the operators a*(f). This difficulty can be removed 
by replacing (12.7) by suitable relations for bounded functions of a*(f) in analogy 
with Section 8.2; this will be done in the following section. 


12.3. Systems of noninteracting particles 


Second-—quantization formalism has numerous applications but we restrict ourselves 
to discussion of two simple examples; the reader can find much more information in 
the literature quoted in the notes. 

To describe a particular system, we have to specify the one-particle state space 
H, the operators representing one-particle observables, among them the Hamil- 
tonian in the first place, and the interaction between the particles. The simplest 
situation occurs when the particles do not interact. In accordance with what we have 
said in Section 11.3 about noninteracting systems of a finite number of particles, we 
assume that the operator 7,“(H) corresponding to the one-particle Hamiltonian 
H plays the role of the total-energy operator. Theorem 12.1.6 then claims that 


exp (-172>(H)t) = File") (12.8) 


holds for any t € JR; in other words, that the propagator of the free-particle system 
is obtained as the second quantization of the one-particle propagator. 

The first system we are going to treat is a free scalar quantum field. As a system 
of spin-zero particles, it is associated with the symmetric Fock space F,(H) , where 
the one-particle state space H will be specified below. Before doing that, however, 
we have to analyze the relations (12.7) in more detail. In order to draw the line 
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between them and the canonical commutation relations of Section 8.2 we define for 
any f € H the Segal field operator as 


6s(f) == 7 (a(f) +a*(f)) 
with the domain D, c F,(H). 


12.3.1 Theorem: (a) ®s(f) is e.s.a. for any f eH, 


(b) The set {s(f1)...Bs(fn)%: f7 € H, n =0,1,...} is total in F,(7) ; in 
other words, the vacuum vector is cyclic for the algebra generated by J, and 
the field operators ®g(f) for all f eH, 


(c) [®s(f), ®s(g)]¥ =iIm(f,g) YU holds for any f,ge H and VEDs. 


12.3.2 Remark: In addition to field operators, quantum field theories use canoni- 
cally conjugate momenta. In the present context, these are defined by 


I(f) := @s(éf) == a (a*(f) — a(f)) 


(see the notes; one has to keep in mind that there is no direct relation between 
II(f) and the momentum observable of a given field). Given an orthonormal basis 
{f;}52, in H, assertion (c) of the theorem can rewritten for ®(f;) := ®s(f;) and 
Ue D, in the form 


[Os(fj),Os( fe) Y = [Is(fj), Us(f)]Y = 0, [Os(fj), Us(fe))Y = ib, 
which is obviously analogous to (8.6). 


Proof of Theorem 1: It follows from Theorem 12.2.2 and Proposition 4.1.2d that 
;(f) is symmetric, so it is sufficient to find a dense set of analytic vectors for it. 
The operator ®s(f)™ is a polynomial in a*(f) for any positive integer m; hence 
each W € D, belongs to D(®s(f)™). The subspace D, contains vectors describing 
states with a finite number of particles, z.e., for any YW € D, there is a positive 
n such that Y € @f_, HS). Moreover, it is obvious that @s(f) maps H{*) to 
H&-) @ H+) | Problem 10b then yields the estimate 


[Ps(™Vlle < 22 (rt mln m— 1). (+1) elif, 
which implies 
_ — @s(/)"Wlie < le y (eon Wasa 


for any t > 0. The set D, , which is dense by Proposition 12.1.3, therefore consists 
of analytic vectors. Assertion (b) follows from the mentioned proposition together 
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with (12.2) and (12.3), and finally, the commutation relations of assertion (c) are 
obtained by a simple computation from Theorem 12.2.2. [ff 


The first assertion of the just proved theorem allows us to cast the canonical 
commutation relations into an alternative form. For any f € H we define the 
unitary operator 

W(f) = exp (i®s(F)) ; (12.9) 
we shall show that it represents a counterpart to the Weyl operator of Section 8.2. 


12.3.3 Theorem: The operators (12.9) satisfy for any pair f,g € H the relation 
W(f)W(g) = e mW (Ff +9). (12.10) 


The map f + W(f) is strongly continuous and the set {W(f): f € H} is 
irreducible. 

Proof: As for the strong continuity of W(-) and the relation (12.10) we refer to 
the literature mentioned in the notes. The irreducibility is a consequence of the 
following result: 


12.3.4 Lemma: Let {¢;} be an arbitrary orthonormal basis in # ; then the set 
{ W (td;), W(itd;) : 7 = 1,2,..., t € IR} is irreducible. 

Proof: Since W(—f) = W*(f) holds by definition, the set under consideration is 
symmetric, and in view of Schur’s lemma it is sufficient to check that any bounded 
operator B on F,(H) such that [B,W(tf;)] = 0 is valid for f; = ¢; or f; = 
ip; equals a multiple of the unit operator. Differentiating this relation, we find 
B@s(f;) C ®s(f;)B. We introduce 


N(f) = 5 (@s)' + B50) Isl?) 


for any f € H; the reader is asked to check simple properties of these operators 
in Problem 12. Now take a vector ¥ = {v%o,%1,..-} € MN; D(N(¢;)) such that a 
finite limit n(V) := limm.. DZ, (VY, N(¢i)¥)r exists; since the operators N(f) 
map each H™ onto itself, the inner product on the right side can be written as 
limpoo Wteo (We, Nu(Gi)We)e, where Ne ($i) == N(di) /H , and therefore 


co dim, jim, > dX (this Ne (Pie )e - 
The operators N(¢;) are positive by Problem 12b, so the double series may be 
rearranged; this yields n(V) = limpseo liMmoo (Un, VE1 N(Gi) Un) p with U, = 
{uo,..-;WUn,0,...}. However, any such vector belongs to D(N,) due to Prob- 
lem 10a and N,Vn = limmooo DZ, N (Gi) Un ; hence 


n(W) = Jim (Wn, NeWn) = 3 kiledall2 
k=0 
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Using the fact that the vacuum vector is contained in (; D(N(@;)) together with 
Problem 12a, we find that BQ € 1, D(N(¢;)). Furthermore, Problem 12b gives 
N($;)BQ% = BN(¢;)BQ% = 0, so n(BQo) exists and equals zero; the identity 
displayed above then implies BQ = aN for some a € C'. Finally, a*(f) = 
2-1/2(bs(f) — i@(if)) , and therefore 


BY? (i, - e » Pin) = aW?,(di,,- a Pin) 


follows from the relations B®s(f;) C ®s(f;)B and (12.3). This identity extends 
linearly to the subspace D,, and since the latter is dense in F,(H) and B is 
bounded, we obtain B=al,. § 


The relations (12.10) imply, in particular, that the operators (12.9) satisfy for 
any pair f,g € H the relations 


W(f)W(g) = "FW (g)W(f). (12.11) 


Given an orthonormal basis {¢;} C #, we may define the continuous one-parameter 
unitary groups U;(t) := W(it@;) and V;i.(s) == W(s@;) ; then 


U;(t)Vi(s) = eV, (5)U;(t) 


holds for all s,t € IR and j,k = 1,2,.... Alternatively, we may introduce the 
real Hilbert space S as the closure of the real linear envelope of {¢;} and define 
unitary—operator valued maps S — F,(H) by U(f) :=W(if) and V(g) := W(g). 
The relations (12.11) may then be rewritten in the form 


U(f)V(g) = eV (g)U(f), 


which is nothing else than (8.12). In this way, we have associated with any one- 
particle Hilbert space an irreducible (unitary, strongly continuous) representation 
of the canonical commutation relations which is usually referred to as the Fock 
representation. 

The properties of this representation depend, of course, on the choice of H. In 
the case of a quantum field the one-particle state space is infinite-dimensional and 
the Stone-von Neumann theorem is no longer valid. Our next goal is to illustrate 
this claim on the physically important case of a free relativistic field, which we shall 
introduce below. 

It is convenient for our present purpose to choose H := L?(JR%). For any 
m > 0, we define on H the operator fm: (Umf)(p) = “/m?+ p? f(p) with the 
usual domain, and the antilinear operator C : (Cf) = f(—p). We shall also need 
the inverse p7,', which belongs to B(L?(UR%)). It is clear that S(JR*) is a common 
invariant subspace of these operators; thus we shall not discriminate between them 
and their restrictions to S(IR*). We also denote Sc(IR®) := { f € SUR’): Cf = f } 
and define 


n(f) 7 ®s(u7, f) ’ ®n(if) = ®s(ipmf) (12.12) 
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for an arbitrary f € Sc(JR*), where ®5(-) is the Segal field on L?(JR*%), and 
furthermore, 
Wmn(g) == exp (i®n(9)) 


for g = f, if . Now we are able to formulate the following important assertion. 


12.3.5 Theorem: Let an operator U € B(F,(L?(JR°))) satisfy the condition 
Wm(g) = Wm(g)U for some m # m’ and all g = f,if with f € Sc(R’) ; 
then U=0. 
Proof: Suppose on the contrary that U # 0 ; without loss of generality we may 
assume that U is unitary (Problem 13). We shall first show that the vacuum is an 
eigenvector of U. For a fixed a € JR® we define the unitary operator V on L?(JR?) 
by (Vf)(p) :-= e@?f(p) (with the choice of observables given below, it represents 
the translation on a). It is easy to see that Sc(JR*) is invariant under V and the 
unitary operator T/'(V) acts as 


(TAV)F, n) Pi, TRG Soa) a coe ten) aay cies Pn) 


for any F, € S,L?(JR*"). Using the transformation properties of the Segal field 
given in Problem 11b we find ®;,(Vg) = TV) O5(9) TV), where m :=m or 
m’. It follows from the functional-calculus rules that 


WalVg) = TV)WalgTAV) 


holds for any g € L?(JR*) , in particular for g = f, if with f € Sc(JR°). Using this 
relation together with UW,,(g) = Win(g)U we readily check that W,,(g) commutes 
with the operator U7} my T'(V), and since the assumptions of Lemma 4 
are satisfied, we have 


UA TEV) “UTMV) = yl. 
for some nonzero y € (. Let F,, be the n-particle component of the vector UM ; 
then the last relation together with the explicit expression of 7,"(V) given above 
implies 
eta F(p1;--+ Pa) Yea Pi,2-4 Pn) 

which is possible with a nonzero y only if F, =0 for n=0,1,...,27¢, UQ% =~yI,. 

To complete the proof, we use the just obtained result to compute the norm 
|®mn(f)Qo||? for f € ScUR*) in two different ways. Evaluating it directly, we get 


17 MOP 4 

2 Jims Vm? + p* 
On the other hand, using the unitarity of the operator U together with the inclusion 
U®m(f) C Pa (f)U , which is obtained by differentiating the condition UWp(g) = 
Wra(g)U , we find 


[m/l = | eal!) + 2° Gf) ] % 


2 iP _ lf) 
— m = D,y( f Molle = 
[Bin f)Q0||3. = |U Pm f)Q0|l% = ll? | Pre (F)Qoll% tf at ries 
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it is obvious that the two expressions cannot coincide for all f € Sc(UR*). W 


Choose now an orthonormal basis {¢;} C L?(JR?) which is contained in Sc(IR*) ; 
such a basis can be constructed easily, e.g., using the Hermite functions (2.1). Let 
S be the real Hilbert space spanned by {¢;} ; then the relations 


Um(f) = Wmlif), Vinl9) = Wmlg), fg eS 


define for any m > 0 an irreducible (unitary, strongly continuous) representation 
of the canonical commutation relations. 


12.3.6 Corollary: The representations {Um(-), Vn(-) } corresponding to different 
values of m are nonequivalent, i.e., there is no bounded invertible operator U such 
that 

UU nlf) Ul = Um (f), UVm(g) U = Vin' (9) 


holds for all f,g € S unless m =m’. 


Proof: Suppose that the operator U exists; then it follows from the definition of the 
representations and from (12.10) that UW,,(h) = W,,/(h)U holds for all h := g+if 
with f,g € S. However, such a set is dense in L*(JR*), and we therefore find to 
any h € Sc(JR*) a sequence {en} that converges to mh € Sc(JR*). Then the 
sequence {hn}, hn := [7,€n, converges to the function h, and similarly, {umhn} 
converges to Umh. In addition, the limits of the sequences {yZ/hn} are puz'h 
as a consequence of the boundedness of the operators uz ; here again we denote 
m:=m or m’. Since the map W(-) is continuous by Theorem 3, the definition of 
Wn(-) implies 


Wia(hn) —*+ Walh), Walthr) — Walth) 


for any h € Sc(JR*). Moreover, the operator U is bounded by assumption, so the 
relation UW,,(h) = Wm(h)U should hold for all h = f,if with f € Sc(UR) ; 
however, this contradicts the preceding theorem. [§ 


Let us now return to the construction of a free quantum field. We have first 
to select the one-particle state space. The most straightforward way to do that 
is to use momentum representation. Since we have in mind relativistic particles 
of spin zero, we choose Hm := Hmno = L?(Hm,dwm), which we introduced in 
Section 10.3. The momentum components are then represented by multiplication 
operators, (P;f)(p) = p;f(p), and the free-particle Hamiltonian H := Py acts as 


(Hf) (p) = ¥m?+ p? f(p) 


for all f of the natural domain; the generators of the Lorentz-group representation 
are obtained by differentiating (10.14) for s=0. 

It is more complicated to express the wave functions in terms of the space- 
time variables z = (%o,Z). By analogy with nonrelativistic quantum mechanics we 
expect that this can be achieved by the Fourier transformation. However, in the 


12.3. SYSTEMS OF NONINTERACTING PARTICLES 419 


relativistic case we may no longer separate the time from the spatial coordinates. 
Given f € S(JR*) , we define 


F(p) = (2m)? is ellPoto-F 2) Fn) dy 


The function f differs from the Fourier transform of f on S(IR*) by the factor 
/2n and the sign in the exponent; this choice is motivated by the requirement that 
the extension of the transformation f ++ f to generalized functions of the type 
f(x) := g(£)6(t) (see the notes) should preserve the standard correspondence (7.4) 
between the coordinate and momentum representation, i.e., f(p) = §(p). 

. If a function has to describe a state of a particle of mass m in the momentum 
representation, its support must be contained in H,,. We introduce the map E,, : 
SUR*) += Hn» by Enf := f } Hm; then we are able to define 


$,,(f) thes 5(E,,Re f) as i®s5(E,,Im f) (12.13) 


for any f € S(R*), where ®g is the Segal field on F,(Hm). The map f t+ @n(f) 
is called the free scalar Hermitean field of mass m. Its time evolution is given 
by (12.8), and in a similar way we can obtain its behavior with respect to the other 
Poincaré group transformations from Proposition 10.3.3 and Problem 11b. 


12.3.7 Remark: The restriction to functions of S(JR*) is not just a matter of 
convenience. The field operators are often written in a formal analogy with (12.6) 
as 


(0) = Zz D (Hel@)a" (on) + de()a(0n)) 


or ®(@) = 2-1/2 (o (2) os @-)(2)), where ®‘*)(x) denotes the “creation” and 
“annihilation” part, respectively; {@,} C S(JR*) is an orthonormal basis with re- 
spect to an orthonormal product which need not coincide with (-,-). The expansion 
on the right side does not converge to an operator. However, on a formal level it 
yields for a*(f) the expression 


a'(f) = Dade) [baila de = f Oe) f(a) de 


and a similar relation for a(f) , where the left side is well defined. Motivated by this, 
we regard quantum fields as operator-valued distributions which associate the ope- 
rators &'+)(f) :=a*(f) and &@)(f) := a(f) with each test function f € S(R*). 
An alternative formulation uses “matrix elements” of the field operators: the map 
(V1, ®,,(-)¥2) is a tempered distribution for all ¥,,W2€D, (see the notes). 


The second-quantization formalism is not restricted to description of quantum 
fields. As another illustration, let us mention an application in statistical physics. 


12.3.8 Example: The system of a large number of nonrelativistic particles (fermi- 
ons or bosons), which are confined to a certain region M C IR® and do not interact 
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mutually, is called an ideal (respectively, Fermi or Bose) gas. The one-particle 
state space is then L?(M;C€*+), with s being the spin of the particles, and 
the one-particle Hamiltonian is usually chosen as H := H @1,, where H is a 
suitable self-adjoint extension of the symmetric operator Ho : How = —Ay with 
the domain D(Ho) = Co°(JR®) and I, means the unit operator on €7*+'. The 
number of particles is preserved in this case, so it would be sufficient to work with 
the subspace HY) in F,(H) for some fixed large number n (a typical value is 
n ~ 107%), but it is more convenient to use the Fock space. 

In the antisymmetric case it is customary to choose for the algebra of observables 
the C*-algebra A C B(F.(H)) generated by the creation and annihilation operators 
for all f € H. In the symmetric case, where the a*(f) are unbounded, we define 
A as the C*-algebra generated by the corresponding Weyl operators (12.9). An 
important role among states on the algebra A is played by the Gzbbs states 


Tr ( A eaike) 


aaa A) eae 


(12.14) 
where K, := 7,*(H — I) = T2(H) — uN, which describe the gas at thermody- 
namical equilibrium for an inverse temperature (@ and a chemical potential p. 
There is an important physical difference between the symmetric and antisym- 
metric cases. To understand it, suppose that the one-particle Hamiltonian has a 
purely discrete spectrum consisting of eigenvalues FE; < E, < --: (with repeti- 
tion according to multiplicity) which correspond to eigenvectors ¢;,¢9,... (this is 
a natural assumption as long as the region M is bounded — see the notes). It 
is easy to see that the elements of the corresponding occupation-number basis are 
eigenvectors of the second—quantized Hamiltonian, 
oo 
7 (H) Pin} = BOh{n}, EF = Ein} = >) nek (12.15) 


t=1 


(Problem 14). The energetically lowest state, in which an ideal Fermi gas consisting 
of n particles may exist, therefore corresponds to the vector ®@{1,...,1,0,...}, 
z.e., each of the n lowest states of the Hamiltonian H is occupied by one particle. 
In contrast, in an ideal Bose gas the one-particle ground state ¢, may be occupied 
by a larger number of particles, in the extreme case even by all of them; the ener- 
getically lowest state here is ®3{n,0,...}. This effect is called the Bose-Einstein 
condensation. 


Notes to Chapter 12 


Section 12.1 There are two motivations for the introduction of Fock spaces. In some 
physical systems the number of particles is not preserved in the course of time evolution; 
this is typical for processes studied in quantum field theories. On the other hand, in statis- 
tical physics we usually deal with large systems where the number of particles is preserved, 
but it is technically impossible to determine it; in that case it also is more convenient to 
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work in the Fock-space framework. Examples will be given in Section 12.3. The spaces 
F.(H) and F,(H) are sometimes called Bose—Fock and Fermi-Fock, respectively. Of the 
two possibilities discussed in Example 2, only ¥,(L?(JR°)) has a direct physical meaning, 
because the state space L?(JR*) is associated with no real fermion. 

We have mentioned that the operators TA) corresponding to a self-adjoint A 
are often denoted as dI'(A) ; similarly we put ['(A) := Z(A). The origin of this 
notation is illustrated by Theorem 6. We have I'(e*4*) = e*4!(4) ; hence I maps e*4* 1 
{ I, e*4t ett @ eit} and dl may be formally identified with the linear part of this 
map at the point t = 0 which associates A ++ {0, A, A; + Apg,...}. 

The idea of the second—quantization method was formulated in [Fo 1]; on a formal 
level it is discussed in most quantum field theory textbooks — see, e.g., [Schwe], Part II; 
[BS], Chap.II, ete. A pioneering role in the mathematically correct treatment was played 
by the papers [Coo 1] and [Seg 4]; in addition to them, there are nowadays numerous 
sources from which information about the rigorous formulation of second quantization can 
be derived, for instance, [ Ber]; [BR 2], Sec.5.2; [GJ], Chap.6; [RS 2], Sec.X.7; [Sv], 
Chaps.3 and 6; [Thi 4], Sec.1.3. 


Section 12.2 Creation and annihilation operators, and also the occupation number basis, 
are standard tools of the second—quantization method which were already introduced in 
[Fo 1]. One can employ a*(f) to express other operators on F,(H). A simple example 
is the operator of the number of particles (Problem 10a); in a similar way for the second 
quantization ie (T) of a one-particle operator T’ and an orthonormal basis {¢;}52, of 
# we obtain the formal expression 


TE(L) = Yo (bi Tx) a°(;)a(Gx) - 


jk 


The meaning of the series depends on the operator T' ; however, if we use the formula to 
compute matrix elements of the operator TE) in the basis € corresponding to {¢;}, 
the series reduces to a finite sum and no problems arise. The expression is particularly 
illustrative if T has a pure point spectrum and {¢;} is the basis of its eigenvectors. 

The same is true for the second quantization of a k-particle operator T‘*) , which 
can be constructed by a straightforward generalization of the procedure described in the 
previous section. We formally obtain the expansion 


TOT) == 6 B+ Ob TH, @ ++ @ by) 


Thiywooguhs 
saan t 


x a"(Gj,)...0"(4,)a(G1,)--- (bn); 


where the meaning of the series again depends on the operator T‘*) . Expansions of this 
type with the creation operators placed to the left of the annihilation operators are usually 
referred to as the normal form of the operator under consideration. More details on these 
problems can be found, for instance, in [Ber], Sec.I.1; [Sv], Chap.6. 


Section 12.8 For acomplex function f the operator (12.9) can be written in the form 
W(f) = exp ((@(Re f) + (Im f ))) , which illustrates that it does indeed generalize the 


Wey] operator of Sec.8.2. As in the case of a finite number of degrees of freedom, one 
can use W(f) to define coherent states on F,(H) — see [Da 1], Chap.8. The relation 
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(12.10) can be checked, e.g., by a direct computation using power-series expansions on 
the set D, of analytic vectors; this method is used in [RS 2], Sec.X.7. An alternative 
proof is given in [ BR 2], Sec.5.2. At the same places the reader can find a proof of the 
strong continuity of the map W/(-), which is based on the result of Problem 11b and the 
fact that the domain D, is a core for ®s(f). 


The Segal field operator was introduced in [Seg 4]. It corresponds to the formal 
expression of Remark 7; its advantage is that it allows an easier formulation of the CCR. 
On the other hand, from the distribution-theory point of view it is more convenient to 
have the field (-) complez linear; this motivates the definition (12.13). For properties 
of operator—valued distributions, the reader may consult, e.g., [ BLT ], Sec.3.1; [SW], 
Sec.3-1; [ Ber], Sec.I.1; [Si 2], Sec.II.1. The definition (12.13) is usually extended to 
distributions of the type for : fg,r(x) = g(Z)6(t) with g € S(UR*) ; the corresponding 
modification of the map ©,,(-) is called the field at an instant 7. Of course, we lose the 
relativistic covariance, but on the other hand some properties formulate more easily for 
fields at a fixed instant. 

The representations of Corollary 6 do not exhaust the list of nonequivalent represen- 
tations of canonical commutation (or Weyl) relations. There are other representations 
which are not of the Fock type. Such representations correspond, in particular, to all 
interacting fields. This is a consequence of the Haag theorem which, loosely speaking, 
claims the following: suppose that ®)(.) and (?)(.) are quantum fields which satisfy 
the Garding-Wightman axioms (see Sec.13.3 below) and ®"1)(-) is a free field of mass m. 
If the fields are unitarily equivalent, then {2)(-) is also a free field of mass m. 

Haag’s theorem appeared for the first time in [Haa 2]; its proof and extension can be 
found in [SW], Sec.4-5; [BLT], Sec.5.4. The exact formulation of the unitary equivalence 
uses fields at a fixed instant mentioned above; what is essential is that the equivalence 
is assumed for complex~valued test functions, i.e., for both fields and their canonically 
conjugated momenta. The main importance of the Haag theorem is that perturbative 
methods based on the interaction picture cannot be applied to interacting fields in a 
straightforward way such as is common in formal perturbative computations. Let us 
remark, on the other hand, that our ability to distinguish nonequivalent representations 
of the CCR in an actual physical experiment is limited. This problem is analogous to the 
one discussed in Section 7.4; a detailed discussion can be found in [ Haa 1). 

Quantum field theory is not a priori relativistic. An illustration of this claim is 
provided by the Lee model mentioned in the notes to Sec.9.6, which describes a field of 
light nonrelativistic “6-particles” interacting with a heavy nucleon having two internal 
states. In spite of the nonrelativistic character of the model, the number of particles in 
it may not be preserved, provided the parameters are chosen in such a way that emission 
of a §-particle is possible; this process is closely related to the Friedrichs model discussed 
in Sec.9.6. A more general discussion of nonrelativistic field theories can be found in 
[LeL 1,2]. Nevertheless, the central place in quantum field theories undoubtedly belongs 
to relativistic theories. 


We did not specify the region M in Example 8. The standard statistical—physics 
procedure is to treat the system first in a bounded M, and then to study the behavior 
of the results when M blows out; this is referred to as the thermodynamic limit. The 
boundedness of M means that the one-particle Hamiltonians considered in the example 
have purely discrete spectra; this can be checked easily for parallelepipeds, balls, etc., but 
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the result is valid for much more general regions — see Theorem 14.5.2. Notice that the 
discrete character of the spectrum is essential for the existence of the states (12.14). More 
information about ideal gases can be found, for instance, in [BR], Sec.5.3; [Thi 4] and 
[Ja], Chap.15. 


Problems 

1. The Fock space F(H) is separable iff the same is true for H. 

2. Prove Proposition 12.1.3 for the operator 7,"(T). 

3. Let A be a self-adjoint operator. Its spectrum is related to that of T™(A) by 

_ [es] femeat co 
o(T(A)) > Uo (4B) > {0} U UY POCA) x ++ x (A), 
n=0 n=1 
where PH) (t,, se sgiayteHtyt+-- +h. 
4. With the notation of Theorem 12.1.6, prove 
(a) {7"(U(t)) : t € R} is a strongly continuous group of unitary operators 
which leaves the subspaces 7‘) invariant, 
(b) the group {7,"(U(t)) : t € IR} is generated by 7,7(A). 
Hint: (a) Use Problem 5.45. 

5. Let j be a nondecreasing variation. Given p € S,, show that the number of per- 
mutations p’ € S, such that jop = jop’ is independent of p and equals n;!...nm!, 
where 7; are the numbers of repeated values in 7. 

6. Prove the relations (12.4) and (12.5). 

Hint: Check the relations for a*(¢,) ; then use Theorem 12.2.2. 

7. Prove the relation (12.6). 

Hint: Check it first for ¥ € D{” 
8 Givenamap F,: Z,x-:-x Z,—+-@, where Z,. is the set of positive integers, 
prove 
> 4) 
s YP) Fr(Jpss oR Jpn) = ye be —1 YP) Fa (jis Seis 200 Ire Ires1 cated Irn) ’ 
PESn kel res), 
where 5p ) uP ) are as in the proof of Theorem 12.2.2 and si), is the set of 
permutations of the numbers (1,...,k —1,k41,...,n). 
9. Prove (a*(f1)...a*(fn)a(g1) ---@(Gm))*} Dp = a*(9m)---0*(91)a(fn)-.-a(fi) and 


the analogous relation with the interchange of the creation and annihilation opera- 
tors. 
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10. Let Nz be the operator of the number of particles of Example 12.1.5. Prove: 


a) if {¢;}%, is an orthonormal basis in H, then NZW = S-F2, a*(d;)a(o;)¥ 
ISj=1 J 
holds for an arbitrary vector Y € Dp, where the series converges with respect 
to the norm of F,(H), 
(b) in the symmetric case, the inequalities ||a*(f)¥||r < ||f|| |(Me+1)'/7Ull} and 
1/2 
laf) Wlle < IF ll We? ¥% hold for any Y € DINs””). 


Hint: (b) Use the occupation—number basis. 
11. Check the following properties of Segal field operators: 


(a) let a sequence {f;}92, C H converge to some f € H; then %s(f;)¥ — 
®>(f)W holds for any WE D,, 


(b) TH(U) s(f) TA) = ®s(Uf) holds for any unitary operator U on the 
one-particle space 71, 

(c) the preceding assertion extends to unitary operators V : H — H’ between 
different Hilbert spaces. We define the map 7,!'(V) : F.(H) — Fs(H’) by 
TH(V)Un(fiy--+>fn) = Sntn(V fi,---,V fn) ; then the relation 


TA(V) Os(f)T(V) = O5(VS) 


is valid between the corresponding Segal field operators on the spaces F,(H) 
and F,(H’), respectively. 
Hint: (a) Use the result of the previous problem. 


12. Consider the operators N(f) from the proof of Lemma 12.3.4. Show that 


(a) if a bounded B commutes with W(tf;), then BN(¢;) C N(¢;)B, 


(b) N(f) is positive for any f € H and N(f)}D, = N(f))D, = a*(f)a(f), 
where N(f) := TAF, )f)- 


13. Show that if U satisfies the assumptions of Theorem 12.3.5 and U #0, then there 
is a € R such that aU is unitary. 
Hint: The operators U*U, UU* commute with W,, and W,,’, respectively; use 
Lemma 12.3.4. 


14. Prove the relation (12.15). 


Chapter 13 


Axiomatization of quantum 
theory 


13.1 Lattices of propositions 


In Section 7.6 we mentioned the motives to axiomatize a physical theory. Now we 
are going to discuss two axiomatic approaches to quantum theory. The first of them 
is concerned mostly with a justification of the postulates formulated above; it is 
based on a deeper analysis of the properties of yes—no experiments. 

For brevity we replace the expression “outcome of a given yes—no experiment” by 
the term proposition; the dichotomic character means that a proposition is either 
valid (true) or not valid (false). Various relations can exist between the propositions 
concerning a particular system. In general, they depend on the state of the system. 
Some of the relations are state-independent, however, and we shall be interested 
in them in the following. For instance, the inclusion A, C A, C JR? means that 
validity of the proposition “the particle has been found within A,” follows from 
the validity of “the particle has been found within A,” irrespective of the particle 
state. In such a case we say that a implies b and write symbolically a Cc b. In 
accordance with experience we adopt the following assumption: 


(pl1) the relation C defines a partial ordering on the set L of propositions 
corresponding to a physical system. 


It implies, in particular, that the relations a C b and a D b mean identity of a and 
b, z.e., that the two propositions are simultaneously either true or false. 
Furthermore, we introduce the proposition ab, which has the meaning of the 
simultaneous validity of the propositions a and 6. Mathematically it can be defined 
as the infimum (or greatest lower bound) of the set {a,b}, by which we understand 
the element aNb € £L such that c C amb holds iff cC a and c C b; in the same way 
we introduce the infimum f)\,¢5a@ for any subset S C L. Physically the meaning 
of the proposition aM 0 is clear provided the order in which the corresponding 
yes—no experiments are performed is unimportant; its validity expresses the fact 
that the positive result has been found in both of them. In general, the situation is 
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more complicated since the observables may be noncompatible, but even then the 
proposition aMb makes sense: we can identify it with the positive outcome of the 
idealized measurement consisting of the sequence {a,b,a,b,...} — see Example 3 
below and Remark 3.2.10. We shall not go into details and shall adopt the following 
postulate: 


(pl2) the infimum (yes a exists for any subset SCL. 


This implies the existence of the absurd proposition 9, which has the property that 
@c a for any a€ L; by definition 0 = Mueca. 

The third basic postulate equips the proposition set £ with the operation of 
orthocomplementation defined in the following way: 


(pl3) for any a € L, there is a proposition a’ € £ such that 


(i) (a’)'=a, 
(ii) ana’ =90, 
(iii) the relation a C b implies b’ C a’. 


In this case the interpretation is easy; the proposition a’ refers to the same yes- 
no experiment, with the results switched: if a is true, a’ is false and vice versa. 
The postulates (pll)-(pl3) may be expressed in a more concise form if we use the 
terminology of the algebraic lattice theory (see the notes): 


(pl) the family CL of propositions corresponding to any physical system forms a 
complete orthocomplemented lattice; 


for brevity we shall drop the adjectives and speak about CL as a lattice. The postu- 
lates have some simple consequences. Any subset S C L has the supremum (or least 
upper bound) Uses a, which is by definition a proposition such that b> Ugesa iff 
b> a for all ae S. The supremum and infimum are related by a de Morgan-type 
formula (Problem 1). The proposition aUb has the meaning “a or 6 is valid”. The 
complement to the absurd proposition is the trivial proposition I := (\geca which 
is always valid. 

The requirements on the family of propositions we have formulated until now 
are of a general nature and contain nothing peculiar to quantum theory. 


13.1.1 Example (proposition lattices in classical mechanics): The phase space IR?" 
is associated with an unconstrainted system with n degrees of freedom; its points 
are described generalized coordinates qi,...,g@n and the canonically conjugated mo- 
menta p),...,Pn- A typical proposition about such a system reads “the value of a 
Borel function F : IR?" — R lies in a Borel set A” (for instance, the energy of 
the harmonic oscillator, H(q,p) = q?+ p?, assumes a value E or is contained in 
an interval (E£,, &,)). The admissible values of the coordinates and momenta then 
have to satisfy the relation (q1,..., Pn) € F‘-")(A) ; this allows us to ascribe to any 
proposition a € Ly a Borel set M C IR in such a way that 


M(anb) = M(a)N M(b), M(a’) = R*"\ M(a). 


13.1. LATTICES OF PROPOSITIONS 427 


The propositions to which the same subset of the phase space corresponds are phys- 
ically equivalent, and therefore they can be regarded as identical and the map M(-) 
can be regarded as a bijection. 

The validity of the postulates (pl) for £, then follows from properties of the set 
operations, with the exception of (pl2): the family B?”" is closed with respect to at 
most countable intersections, so £,, is an countably complete orthocomplemented 
lattice. On the other hand, CL, has two additional properties. First of all, 


(pl,) any a,b,c € La satisfy the distributive law, i.e., 
aU(bNc) = (aUb)N(aUc), aNn(bUc) = (aNb)U(aNnc); 
a lattice with this property is called Boolean. 


Furthermore, a point in a lattice is a minimal proposition different from 9 ; if b is 
such a proposition, then the inclusions a C c Cc (aUb) imply c=a or c=aUb. 
We can easily check that 


(pla) for any proposition a € La different from @ one can find a point bC a;a 
lattice with this property is called atomic. 


The proposition lattices of quantum mechanical systems have a different struc- 
ture; the main difference is that they are not Boolean. 


13.1.2 Example: Let a; be the proposition “the j-th component of spin has the 
value 3 ”. The proposition a; Ma, is absurd if 7 #k, and similarly aa, = 9, so 
a; Ua, =I. It implies, e.g., (a; Ua2)M (a; Ua3) = J while a; U(a2Ma3) =a, ¥ I. 


In order to be able to weaken the requirement (pl,), we introduce a few notions. 
A sublattice is a subset in L, which is itself a (complete, orthocomplemented) lattice 
with respect to the same operations. For any set M C C there is a minimal lattice 
which contains it (cf. Problem 3); we call it the lattice generated by M . Propositions 
a,b are said to be compatible if the lattice generated by the set {a,b} is Boolean; 
we use the symbol a «+ b. The family of propositions in £ that are compatible 
with all a € L is called the center of CL ; if the center contains only the propositions 
0, I we say that the lattice is irreducible. 

The requirement which replaces distributivity for quantum systems can now be 
formulated as follows: 


(pl.,) the relation a C b between propositions implies their compatibility, a o b. 


It is usually called weak modularity; we shall check that it is satisfied in standard 
formalism of quantum theory. 


13.1.3 Example: Consider a coherent quantum system with the state space H ; 
the family of yes—-no experiments is in this case identified with the set B(H)” of 
all projections on H. The partial ordering is given by the inequalities between 
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the projections. It has the needed interpretation: a C b is equivalent to E(a) < 
E(b). To any two propositions a,b we define E(a™b) as the projection onto 
the subspace E(a)H M E(b)H ; in a similar way we define the infimum for any 
family of propositions. The orthogonal complement is described by the projection 
E(a') := I— E(a), and the absurd and trivial propositions are represented by the 
zero and unit operator, respectively. It is easy to see that the conditions (pl1)- 
(pl3) are fulfilled, i.e., that B(#)" with the mentioned operations is a complete 
orthocomplemented lattice; in addition, it is atomic, the role of points being played 
by one-dimensional projections (Problem 4). 

We shall prove that B(H)® is also weakly modular. This requires that we 
first check that the notion of compatibility introduced above is consistent with the 
definition of Sec.7.5. If a <— 6, then the propositions a,b as well as a’,b’ belong to a 
Boolean sublattice in B(H)® , i.e., we have (aMb)U(and’) = a and (anb)U(a'Nb) = 
b. The inclusion aNb! c (a/Mb)’' follows from aNb' c b' C a’ UD! and Problem 1b 
for S = {a’,b’} ; in a similar way we get a’Nb C (aMb)’ and anv’ C (a’Nb)’. Using 
Problem 4a, we then find that the projections E(aMb), E(a’Nb) and E(anbv’) are 
mutually orthogonal and satisfy the relations 


E(a) = E(anb)+E(anl), E(b) = E(anb)+ E(a' nb); 


it yields the commutativity of F(a), £(b). On the other hand, suppose that F := 
E(a) and F := E(b) commute. We can check directly that 


S= (0,1, 8)% 16, =F OF i ppp hP) P2eP Teer, 
1-F4+ EF, B>F-8F, |+ B=F4+0F, 6+F-0DP, d- BPP } 


this the lattice generated by {F, F'} since it is closed with respect to the multipli- 
cation, which represents the operation M, and P&S implies J—-P eS. The set 
S is obviously commutative and PUQ = 1 — (J— P)(I-Q) = P+Q-PQ holds 
for any P,Q €S; it follows that the projections E(a) and E(b) are compatible in 
the lattice sense. Hence the two definitions are equivalent and the weak modularity 
of B(H)” is a consequence of Proposition 3.2.9b. 


13.1.4 Remarks: (a) The proved equivalence shows at the same time that the 
irreducibility of the lattice B(H)” is equivalent to its irreducibility as an 
operator set. This correspondence can be extended to more general situations. 
Consider, for instance, a quantum system whose algebra of observables is of 
the form (7.18). The set A” of all projections in A forms an (atomic, weakly 
modular) lattice, which is reducible unless the index set J consists of a single 


point; its center is generated by the projections onto the coherent subspaces 
(Problem 5). 


(b) Propositions a,b are said to be disjoint if a C b! ; the requirement of weak 
modularity means that disjoint propositions are mutually compatible. 


These considerations show that in the quantum case the postulate (pl) can be 
appended as follows: 
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(pl,) the set of propositions for a quantum system is an atomic, weakly modular 
(complete, orthocomplemented) lattice. 


In order not to repeat the adjectives all the time, we shall refer to a family L 
which satisfies the condition (pl,) as a proposition system. Two such systems are 
isomorphic if there is a bijection between them which preserves the ordering and 
the orthogonal complement, and maps the infimum of any subset M Cc C to the 
infimum of its image. 

The basic question that the discussed axiomatic approach should answer is 
whether there are proposition systems, which are not isomorphic to the standard 
system of Example 3, or whether on the contrary the quantum mechanical formalism 
treated in the previous chapters is the only possible one. We return in this way to 
the question considered on a heuristic level in Section 7.1. 

This problem is highly nontrivial and we restrict ourselves to a brief description 
of the result; references to the proofs are given in the notes. First we have to mention 
how a proposition system can be reduced. If the center of the lattice £ contains 
a finite number of propositions different from @, J, it is easy to decompose L to 
irreducible components (Problem 6). The same procedure can be applied in the 
general case; it is just more difficult to check the existence and uniqueness of the 
decomposition. With this fact in mind, we shall suppose in the following that the 
proposition system CL is irreducible. 

A few more notions are needed. The dimension of an atomic lattice is the 
minimal number of compatible points 6,,62,... such that U;b; = J. Let F bea 
field; an involutive automorphism is a map *: F — F' such that 


(a+ 8)* = a*+ B*, (a6)* = Bra’, Ce Qa 


holds for any a, @ € F ; examples are the identical mapping in RR or C, the complex 
conjugation in € , etc. Given a vector space V over F’, we define a symmetric 
sesquilinear form f : V x V — F in the same way as in Sec.1.1 replacing only the 
complex conjugation by the corresponding involutive automorphism. A particular 
case of the sesquilinear form is an inner product, which is again defined by the 
requirement that f(z,r)=0 iff «=0. In addition to real and complex Hilbert 
spaces, one can in this way define, for instance, Hilbert spaces over the field Q of 
quaternions (see the notes). The mentioned result now reads as follows. 


13.1.5 Theorem (Piron): (a) Let £ be an irreducible proposition system of a 
dimension > 4. Then there is a vector space V over F’, an involutive 
automorphism of the field F', and an inner product (-,-): V x V — F’ such 
that L is isomorphic to the lattice consisting of those subspaces in V , which 
are orthogonal complements of some subset M C V with respect to (-,-). 


(b) In addition, if the field F’' is required to contain the reals, then the proposition 
system L is isomorphic to the lattice of closed subspaces in a Hilbert space 
over some of the fields IR, €, or Q. 
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Loosely speaking, the existence of partial ordering and the operation  toge- 
ther with the assumptions of atomicity and weak modularity determine the vector 
structure of the state space, leaving the choice of the field open. The fact that CL is 
equipped with orthocomplementation induces the existence of an inner product. 

If we adopt the hypothesis of part (b), for which we have no empirical support, 
the theorem leaves three possibilities open; in addition to the standard quantum 
theory the postulate (pl,) also allows theories in which the state space is a Hilbert 
space over the field IR or Q. Both the alternative schemes, real and quaternionic 
quantum theories, have indeed been constructed. In some cases, the equivalence of 
their predictions with those of “complex” quantum theory has been demonstrated 
(see the notes), but in general this problem remains open. 


13.2 States on proposition systems 


We know that states of a quantum system are determined experimentally by mea- 
suring a family of transition probabilities by repeating a certain collection of yes—no 
experiments on a large number of copies of the system. It is therefore natural 
in the axiomatic approach under consideration to identify a state with a function 
p: £— [0,1] whose values are the probabilities that a given proposition is true. 
We require it to satisfy the following conditions: 


(sl) p(@)=0 and p(I)=1, 
(s2) p(U;a;) = %; p(a;) for any sequence {a;} of mutually disjoint propositions, 


(s3) if {a, : a € J} is a subset in C whose elements are valid with certainty, 
p(aq) = 1 for all ae J, then p(Nacy@a) = 1, 


(s4) for any pair a,b€ £L, a# bd, there is a state p such that p(a) 4 p(d). 


Conditions (s2) and (s3) can be justified for a finite number of propositions; for the 
sake of simplicity we make the usual idealization and extend their validity to infinite 
subsystems as well. Requirement (s2) means, in particular, that the inclusion a C b 
implies p(a) < p(b). The purpose of condition (s4) is to eliminate “superfluous” 
elements of system C ; if two propositions can be separated by no state, i.e., p(a) = 
p(b) for all p, it is natural to regard them as identical. It is easy to see that states in 
the standard quantum mechanical formalism fulfil these requirements (Problem 7). 
Suppose that p;,p2 are states on £ and a, are non-negative numbers such 
that a+ 6 = 1; then ap, + Gp. is again a state. Hence the set of all states on 
L is convex; its extremal points will be called pure states as before while the other 
states are mized. For any state p and a proposition a we define its dispersion by 


0p(2) := p(a) — p(a)? ; 


this is consistent with the definition mentioned in the notes to Section 7.3 due to 
the fact that propositions are represented in the standard formalism by idempotent 
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operators, E(a)?= E(a). Furthermore, we define the total dispersion of a state p, 
Op := sup o,(a) ; 
acl 


its value lies in the interval [0,4]. If o, = 0, the state p is called dispersionless. 
Any such state is pure (Problem 8) while the opposite implication is not valid as we 
shall see in a moment. 

The existence of dispersionless states represents an important distinction be- 
tween classical and quantum systems. In classical mechanics such states do exist. 
Consider, e.g., the lattice C4 of Example 13.1.1 and the state p determined by a 
point (q\”,...,p®) of the phase space. We know that Ly may be identified with 
the o-algebra of all Borel sets M Cc JR"; the function M +> p(M) then assumes 
only the values 1 or 0 depending on whether the point (q)”,...,p)) is contained 
in M or not. It follows that p is dispersionless, and therefore pure. Such states 
are typical of classical mechanics; we usually take no other states into account. 

The situation in quantum mechanics is different; owing to the existence of non- 
compatible observables we can find pure states with a nonzero dispersion. 


13.2.1 Example: Consider again the propositions corresponding to the electron 
spin of Example 13.1.2. The state p in which the spin is oriented along the third 
axis, i.e., p(a3) = 1 and p(a,) = 0, is obviously pure; however, p(a,) = 3; so its 
dispersion is op = op(a1) = 3. 


Experience tells us that dispersionless states do not exist in quantum systems. 
This claim can easily be illustrated in the standard formalism. However, we can also 
prove it using only the properties of system £. For simplicity we restrict ourselves 
to the situation without superselection rules. 


13.2.2 Proposition: Suppose that the system is coherent, t.e., the corresponding 
proposition system CL is irreducible; then there are no dispersionless states on it 
unless L is trivial. 


Proof: Let p be a dispersionless state; then p(a) = 1 or p(a) = 0 holds for 
all a € £L. We denote by L, the set of those propositions for which the first 
named possibility takes place, and set e := inf L, ; it follows from condition (s3) 
that. p(e) = 1. If a proposition 6 C e is different from e, then p(b) = 0. The 
propositions }, b’ are obviously disjoint, so (s2) gives p(b) + p(t’) = p(bUY) = 1, 
i.e, b' € L,. We obtain b= bNe C bNV = 9, which means that the inclusion 
b Ce is satisfied only if b=e or b=9; hence e is a point. 

We shall check that e is contained in the center of the lattice L. If ae Ly, 
then e C a and the weak modularity of £ yields a + e. Conversely, p(a) = 0 
implies a’ € £L, as we have demonstrated above, and therefore e <— a’ ; using 
Problem 3e we again get e <a, 7.e., together e ~ a for all a € L. However, the 
center of L is trivial by assumption, so e = / . Since we know that e is a point, it 
follows that the lattice is trivial, C= {0,J}. WU 


432 CHAPTER 13. AXIOMATIZATION OF QUANTUM THEORY 


This fact has been regarded by some physicists as a serious defect showing that 
quantum description of reality is incomplete. A possible explanation was that mi- 
croscopic systems could exist in dispersionless states but for some reason we were 
able to prepare only their mixtures. The quantities that should be appended to com- 
plete sets of observables to enable preparation of dispersionless states has therefore 
been named hidden parameters. If such a “complete” theory could be constructed, 
of course, it must reproduce all experimentally verifiable predictions of standard 
quantum theory. 

It appears, however, that this is impossible. To be able to formulate this result 
more exactly, we need the appropriate definition. We say that a quantum system 
with the proposition lattice £ admits a description by means of hidden parameters 
if there is a measure space (X,A,o) such that o0(X) = 1, and to any state p on 
L, there is amap p: Lx X — [0,1] with the following properties: 


(i) p(a,-) is measurable on (X,A) for any aE CL, 
(ii) p(-,&) is a dispersionless state on £ for g-a.a. EEX, 


(iii) the state p expresses as a mixture of p(-,&)-, t.e., 


p(a) = [| r(a,€) do(é) 
holds for all ae L. 


In particular, if the measure @ has a discrete support {&,...,&:} C X, then 
the last relation may be written as p(a) = \°, wepe(a), where we := 0(&) and 
pe(-) = p(-,&), k = 1,2,..., are dispersionless states; we have, of course, >, we = 
o(X) = 1. We shall need the following auxiliary results. 


13.2.3 Proposition: (a) Let C admit description by means of hidden parameters; 
then p(a)+p(b) = p(aMb) + p(aUb) holds for each state p and any a,be L. 


(b) Propositions a,b in a weakly modular C are compatible iff (aNb’)UbDa. 


Proof: In order to prove (a), it is sufficient according to condition (iii) and linearity 
of the integral to check the identity for a dispersionless p. In that case, p(a) 
and p(b) may assume only the values 0, 1. Let us review the four possibilities. If 
p(a) = p(b) = 0; then the inequality p(anb) < p(a) implies p(amb) = 0. Moreover, 
we have p(a’) = p(b’) = 1, 7.e., p(a’ Nb’) = 1 by the postulate (s3), which yields 


p(aUb) = 1—p((aUb)’) = 1—p(a'nb’) = 0; 


hence the identity is valid. Consider next the case with p(a) = 1 and p(b) = 0. 
Using the inequalities p(a Ub) > p(a) = 1 and p(amb) < p(b) = 0, we see that 
p(aUb) = 1 and p(anb) =0, so the relation again holds true; the same conclusion 
can be made in the case p(a) = 0, p(b) = 1, which differs from the previous one by 
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the interchange of the propositions. Finally, if p(a) = p(b) = 1 we use the identities— 
p(aUb) = 1— p(a’ NW’) and p(anb) = 1 — p(a’ UW’), which give 


p(a) + p(b) — p(anb) — p(aUb) = p(a’ Ud’) + p(a’ NB) — p(a’) — pb’), 


and since we have already proven the assertion for the propositions a’,b’ fulfilling 
p(a') = p(b’) = 0, part (a) is valid. A reference to the proof of (b) is given in the 
notes. &f 


With these preliminaries, we are now able to prove the mentioned result. 


13.2.4 Theorem (Jauch-Piron): If a proposition system £ admits description by 
means of hidden parameters, then any two propositions a,b € £ are compatible. 


Proof: Using Proposition 3a repeatedly, we get 


p((anb')Ub) = p(and’) + p(b) = p(a) + plo’) — p(aUb’) + p(d) 
= p(a)+1+p(aUd) = p(a) + p(a'Nb) = p(aU (a'Nb)). 


By assumption, this relation is valid for all p, so (aN b’) Ub =aU(a'Nb) holds 
due to postulate (s4). However, the proposition on the right side is implied by a, 
and therefore the assertion follows from Proposition 3b. § 


After this digression, let us return to the main question, to what extent the 
formulated postulates determine the structure of quantum theory. In the preceding 
section we dealt with the state space; now we want to know whether the postulates 
(sl1)~—(s4) lead to the standard description of states. For simplicity we shall suppose 
that the assumptions of Theorem 13.1.5b are satisfied, 7.e., that the state space is a 
Hilbert space H over some of the fields R, (, or Q. 

Let W be a statistical operator W on H ; its definition and properties for 
F = R,@ are analogous to the complex case. The map pw : B(H)” — [0,1] 
defined by 

pw(E) := Tr (EW) 


satisfies the mentioned requirements (Problem 9); in other words, py is a state 


on £ = B(H)®. Conversely, we may ask whether for any p, which satisfies the 
conditions (sl) and (s2), i.e., 


p(0) = 1-—p(/)=0 and (> Bs) =). w(Er) ; (13.1) 


where {F,} is an arbitrary sequence of mutually orthogonal projections, we can 
find a statistical operator W such that p = py. It is not difficult to see, for the 
question stated like that, that the answer is negative. 


13.2.5 Example: Any one-dimensional projection on H := €? may be charac- 
terized by a pair of complex numbers, FE = E(a,) with |a|?+|8|? = 1, so that 
the corresponding subspace is spanned by the unit vector (a, 3) € C?. It is obvi- 
ous that. E(Aa, AG) = E(a,8) if |A| = 1. Any sequence of mutually orthogonal 
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projections consists in this case either of a single operator E(a, 8) or of a pair 
E(q, 8), E(B,—&). Let S be the unit sphere in C? and suppose that a function 
f: S—R satisfies the conditions 


f(a, XB) = f(a,8) for jAJ=1, f(a,8) = 1- f(6,—-4). (13.2) 
Then we may define the function ps : B(€?)” — [0,1] by 
py(0) = 1—ps(1) := 0, ps(E(a, 6)) = f(a, 8). 


It obeys the conditions (13.1), and we readily check that py = pw may hold for 
some W € B(C€”) only if the function f is continuous. However, the conditions 
(13.2) are satisfied by other functions, for instance, by 


ale .Ometali< 1 
J soot | 1 ace 
Oo ... jal=1 


and therefore the lattice B(C€?)” supports states without a trace representation. 


Fortunately, it appears to be substantial that we have considered a two—dimen- 
sional space, which contains “too few” projections. In spaces of a higher dimension 
we are able to describe states by statistical operators. This result is highly nontrivial 
and we present it without proof (references are given in the notes). 


13.2.6 Theorem (Gleason): Let H be a Hilbert space over one of the fields R, 
C, or Q and dimH > 3. Then for any function p : B(H)® — [0,1] which 
satisfies the conditions (13.1) there is just one statistical operator W such that 
p(E) = Tr(EW) holds for any E € B(H)®. 


Hence the postulates (s1) and (s2) together with the additional restriction to the 
state space dimension lead to the standard description of states, provided the system 
is coherent and its algebra of observables is A = B(H). Moreover, these assumptions 
may be weakened: the Gleason theorem extends to systems with superselection rules 
and more general algebras of observables — see the notes. 


13.3. Axioms for quantum field theory 


The considerations of the two preceding sections apply to quantum mechanics as well 
as to quantized fields. However, in the latter case the axiomatic approach usually 
means something else; the reason is that the situation here is entirely different. 
In quantum mechanics it is not difficult to associate a well-defined mathematical 
structure with the systems under consideration, i.e., to define the state Hilbert 
space and operators on it representing observables. Although this task is sometimes 
performed for particular physical problems in a mathematically superficial way, there 
is no doubt that with some effort these flaws can be corrected. 
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Quantum field theory, which was born almost simultaneously with quantum 
mechanics, represents a considerably more complicated object. Its first three decades 
brought a host of results and some remarkably exact predictions, in particular, in 
quantum electrodynamics, but the methods used abounded with mathematically 
dubious arguments. Beginning from the early fifties, this motivated the formulation 
of several axiomatic systems, which a correct quantum field theory was expected 
to satisfy; the question was whether the existing formal theories could be put into 
accord with these postulates. The intricacy of the problem is clear from the fact 
that, in spite of great progress, forty years later there is still no exhaustive answer. 
Here we limit ourselves to a description of two systems of axioms; some more will 
be mentioned in the notes. 

' The oldest and best known are the Gérding-Wightman axioms. We shall for- 
mulate them for the simplest case of a scalar field. 


(GW1) Relativistic transformations: States are represented by rays in a complex 
Hilbert space # , and there exists a unitary strongly continuous representation 
U of the proper orthochronous Poincaré group Pl on H, which describes 
transformations of states when we pass from one inertial frame to another. 


The subgroup {(J,a) : a € R*} Cc P representing space-time translations is 
Abelian, so by Problem 5.48 there are four commuting self-adjoint operators P,, u = 
Onl; QpSqsueh thateU U,a)= ei(Pooo-P-4) with P.@ := Dae P;a; holds for a € RR‘. 
The operator H := Po is the Hamiltonian of the system while Pj, 7 = 1,2,3, has 
the meaning of momentum components. Moreover, we know from Chapter 5 that a 
unique projection E(-) on HR’ is associated with the operators P, and 


(¢,U(,a)b) = f eloom-*¥a(4, E(pyw) 


holds for all ¢,y € H. Recall also that the closed future light cone is the set 
Vi := {p= (p0,7): m 20, Pp =p -p > 0}. 


(GW2) Spectral condition: The support of the measure E(-) associated with the 
operators P, is contained in V, ; in other words, the set IR*\V4 is E-zero. 


Note that the postulate (GW2) requires the Hamiltonian Po to be positive. The 
same has to be true for M? := P? — yj_, P?, so it makes sense to define M := 
(M?)}/2 which is called the mass operator. 


(GW3) The existence and uniqueness of the vacuum: The space H contains just 
one vector wp such that U(A,a)%o = vo holds for all (A,a) € Pi ; we call it 
the vacuum. 


This implies, in particular, that the E-measure of the point (0,0) € V4 is nonzero 
and dim Ran E(0,0) = 1. Now we can pass to the requirements imposed on the 
field itself. We mentioned in Remark 12.3.7 that it is reasonable to regard it as an 
operator-valued distribution. In addition, field operators are generally unbounded, 
and therefore they are not defined on the whole H. 
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(GW4) The invariant domain: There is a dense subspace D C # containing the 
vector ~, and the map y: S(JR*) — L(H) with the following properties: 


(i) D belongs for any f € S(IR*) to the domains of the operators y(f), e(f)* 
and y(f)*}D=9(f)}D, 

(ii) y(f)D C D holds for all f € S(R*), 

(iii) the map f ++ y(f)p is linear for each ~ € D. 


Condition (i) implies that the operators y(f) associated with real-valued test func- 
tions f € S(UR*) are symmetric, and (ii) ensures that polynomials in the operators 
y(f), v(f)* are well-defined. This fact in combination with (GW3) means that 
the expressions (wo, o(f1)---y(fn)Wo), called vacuum means, make sense for any 
fi,---,)fn € SUR‘). The distribution character of the field is expressed as follows. 


(GW5) Field regularity: The map f +> (w1,¢(f)yY2) is a tempered distribution 
for any w1,~2E D. 


Next we have to postulate the transformation properties of field operators. 


(GW6) Relativistic covariance of the field: The relation U(A,a)D C D holds for 
all (A,a) € P! , and moreover 


U(A, a) 9(f) U(A, a)'y ae 0(faa)v 
for all f € SUR*) and we D, where faa(x) := f(A7'(r4—@)). 


If we follow the standard convention by which distributions may be written as func- 
tions, t.e., we use formal “ fields at a point x”, the last condition acquires the 
illustrative form U(A,a)y(r)U(A,a)~'W = y(Az 4+ a)y. Recall further that points 
x,y are space-like separated if (2p — yo)*— (€ — y)* < 0; sets Mi, Mo € IR‘ are 
space-like separated if this is true for any x € M, and y € Mo. 


(GW7) Microcausality (or local commutativity): If the supports of f,g € S(UR*) 
are space-like separated, then [y(f), (g)]~ = 0 holds for any we D. 


Finally we have to add a requirement which will ensure that the state space is not 
too large. 


(GW8) Vacuum cyclicity: {p(fi)---9(fn)¥o: fi,---,fn € SUR‘), n € Zs} is 
total in the state space H. 


We have to check, of course, that the system of axioms is free of contradictions. 
It is sufficient to find an example of the quadruplet {H, U, y, vo} which would 
satisfy all the stated requirements. The simplest system that comes to mind is 
naturally the free scalar field (12.13). In this case we already know the state space 
and the vacuum vector, as well as the appropriate representation of the Poincaré 
group which is obtained by second quantization of the one-particle representation of 
Proposition 10.3.3, (Um(A,a)w)(p) := e'*(A7'p), for any ~ € Hm. In a similar 
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way we can check the other postulates, 7.e., to prove the following assertion (see the 
notes). 


13.3.1 Theorem: The quadruplet { F,(L?(Hm, dwm)); AEC), Gx(:); Q% } sat- 
isfies the postulates (GW1)—(GW8). In addition, 


&,,((4 +m’) f) = 0 (13.3) 
holds for any f € S(IR*) , where of := 2f —Af. 


The result is not surprising, of course, because the axioms have been formulated 
with the free field on mind. Much less trivial is the question whether one can 
construct examples of fields with a nontrivial interaction which will obey the GW- 
axioms. This question stimulated a search for rigorous quantum-—field model which 
has produced some remarkable results, though a satisfactory positive answer is still 
missing; we present a few comments on this problem in the notes. 

The second set of postulates that we are going to mention here are the Haag- 
Kastler azioms; in contrast to the preceding case they concern rather the algebraic 
structure of quantum field theory. We can formulate them, for instance, as follows: 


(HK1) Algebras of local observables: A C*-algebra A(M) with the unit element 
is associated with any bounded open region Mc R*. 


(HK2) Isotony: M, C M2 implies A(M,) Cc A(M2). 


The existence of a partial ordering between the algebras of local observables is 
important because it allows us (by a standard procedure called inductive limit) to 
associate the C*-algebra A(M) corresponding to M := Uj2, Mj with a monotonic 
sequence {M;} of sets, Mj4; > M;. In particular, one can in this way construct 
the algebra A := A(JR*) , whose Hermitean elements are sometimes called quasilocal 
observables. 


(HK3) Microcausality: If the regions M,, Mz C IR’ are space-like separated, then 
A(M,) c A(M2)’. 


(HK4) Relativistic covariance: For any (A,a) € P| there is a *-automorphism 
p,q Of the algebra A such that 


ana(A(M)) = A(A-*(M — a)) 


holds for any bounded open region M Cc IR*. The map (A,a) +> Qaq isa 
representation of P]. 


Although the two described axiomatic systems lean on the same physical principles, 
they are not directly related. The Haag-Kastler axioms are more abstract and 
better suited for analysis of general properties of fields such as the structure of 
superselection rules, etc. We expect, of course, that physically reasonable quantum-— 
field models will obey both sets of postulates, as well as other sets which we mention 
in the notes. 
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Notes to Chapter 13 


Section 13.1 As we saw in Sec.7.6, a stumbling point in the algebraic approach to axiom- 
atization of quantum mechanics is the definition of the sum for noncompatible observables. 
This led G. Birkhoff and J. von Neumann to an attempt at formulating a system of axioms 
which would avoid summation of observables — cf. [BvN 1]. Their idea has been further 
developed by G. Mackey, G. Ludwig, J.M. Jauch, C. Piron and other authors; a fairly 
complete overview can be found in [Ja], [Var 1,2], [Lud 1-3], [Pir]; see also the review 
paper [Gud 1]. 

The requirement (pl2) for any subset in C (which is usually referred to as complete- 
ness) again represents an idealization, because we are in fact able to compare only finite 
proposition systems. It has sometimes to be weakened for mathematical reasons — cf. 
Example 1. A more complete justification of the relations between propositions based on 
analysis of the structure of experiments can be found, e.g., in [Herb 1]. A lattice C of 
propositions about a quantum system is often called a quantum logic since, like formal 
logic, it contains the relation of implication and the operations of negation, logical prod- 
uct “a and 6”, and logical sum “a or b”; the name appeared for the first time in the 
mentioned paper [BvN 1]. This analogy should not obscure the fact that £ represents a 
collection of empirical information about a particular physical system; at the same time, 
we have to keep in mind that formal logic has the Boolean property which, in general, is 
not true for £ — see Example 2. A more detailed discussion of this problem is given in 
[ Ja], Sec.5-3. The non-Boolean nature of quantum proposition systems is manifested 
in various ways; for instance, the uncertainty relations in this formalism are treated in 
[Pul 1]. The structure of proposition lattices for classical systems is analyzed in [| Ja], 
Sec.5—-4; [ Var 1], Chap.I. 

The name “weak modularity” comes from the fact that this condition weakens the 
requirement of modularity of the lattice £, which means aU (bMc) = (aUb) Ne for 
a Coc. This property was proposed by Birkhoff and von Neumann as a replacement 
of the distributive law. However, it appears that the lattice B(H)” is modular only if 
dimH < oo, and therefore modularity cannnot be required for the proposition lattice of 
@ general quantum system — cf. [Ja], Sec.5-7. 

The reduction of a proposition system to irreducible components is discussed, for 
instance, in [ Ja], Sec.8-2. Some ideas of the proof of Theorem 5 already appeared in 
the paper [BvN 1], which treated proposition lattices of a finite dimension; the general 
result. was obtained by C.Piron — see [Pir 1] or | Var 1], Sec.VIII.5. The hypothesis 
of Theorem 5b lacks a convincing justification. Nevertheless, one can at least prove that 
the field F cannot be finite unless it is noncommutative — cf. [EZ 1], [IS 1]. It is also 
worth recalling that the field of quaternions or hypercomplex numbers consists of all real 
linear combinations a = a; + a2i +437 + ask, where the quantities i,j,k satisfy the same 
algebraic relations as the multiples of Pauli matrices —io,, | = 1,2,3. 

Many results concerning quantum theory in real Hilbert spaces can be found in the 
papers [Stu 1 4]; interest in it vanished when it became clear that predictions for simple 
systems are the same as in the “complex” case. The “quaternionic” quantum theory 
has attracted more attention. It was first formulated in [FJSS 1,2]. Although for a 
single relativistic elementary particle it reproduces the “complex” results — see [Em 1] 
— in general this theory is expected to have a richer structure. More recent results in 
“quaternionic” quantum mechanics and field theory can be found, e.g., in {HB 1], [ Ad] 1]. 
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Section 13.2 The definition of state requires, to interpret the function p: £ — [0,1], 
a modified way of understanding the probability . As we have mentioned in Sec.7.1, the 
conventional way to introduce this notion leans on a non-negative o—additive measure 
whose domain is a certain o—algebra, i.e., a Boolean lattice; however, proposition lattices 
corresponding to quantum systems are non—Boolean. 

The first result on the nonexistence of hidden parameters was formulated by J. von 
Neumann within the standard quantum-mechanical formalism — cf. [| vN ], Sec.4.2. 
Theorem 4 was proved for the first time in [JP 1], see also [ Ja], Sec.7—3; the proof of 
Proposition 3b which is used here can be found in [Pir 1]. It is worth mentioning that 
Theorem 4 holds irrespective of the assumption of atomicity of the lattice £, and that 
the proof employs postulate (s3) for finite proposition subsystems only. There are more 
sophisticated attempts to introduce hidden parameters into quantum theory but we are 
not going to discuss this question here. 

The proof of Gleason’s theorem is a nice problem, which can be managed with the 
tools elaborated in this book; unfortunately it is too long and complicated to be included. 
The original paper [Gle 1] contains the demonstration for F = IR, C; the extension to the 
quaternionic case is given in | Var 1], Sec. VII.2. These proofs also used the assumption 
that H is separable, but this could be eliminated — see [EH 1]. As we have mentioned, a 
state represents a probability “measure” on the proposition system L ; Theorem 6 extends 
to a class of signed “measures” on L — ef. [Dri 1]. 

The Gleason theorem readily generalizes to the situation with superselection rules, 
where £:= A” with the algebra of observables (7.18) in which the dimension of each 
coherent subspace is at least three — see [ Jor], Thm.28.2. In [Mac 1] a more complicated 
question has been posed, namely whether the analogous result is also valid for the lattices 
of projections in W*-—algebras which are not of type J. The answer is positive provided 
the decomposition of A does not contain type Iz factors, and the same is also true for a 
certain class of Jordan algebras — see, e.g., [BW 1], [Chri 1], [ Yea 1]. 


Section 18.3 The Garding-Wightman axioms were formulated at the beginning of the 
fifties and published a decade later in | WG 1]. We group them following [RS 2], Sec.IX.8; 
usually they are condensed into a lesser number of postulates — see [SW], Sec.3-1; [Si 2], 
Sec.II.1; [GJ], Sec.6.1. Different formulations contain slight modifications; for instance, 
postulate (GW5) is sometimes replaced by the weaker assumption that the test-function 
space is C§°(IR*), the state space H is in addition supposed to be separable, or one 
assumes from the beginning the existence of a representation of the full Poincaré group 
including the reflections, etc. 

It follows from the physical interpretation of the generators of the representation 
U that U(J,a) with a := (—t,0) is the evolution operator of the field y ; postulate 
(GW6) then tells us that the field operators y(f) witha real feS (IR*) are observables 
in the Heisenberg picture — provided they are not only symmetric but self—adjoint as 
well. Postulate (GW3) can be weakened; it is sufficient to require vacuum invariance with 
respect to space-time translations — cf. [RS 2], Sec.IX.8. As for distribution character of 
the field, we are forced to work with “smeared” field operators: one can show that a scalar 
field obeying the GW-axioms cannot be expressed in the form y(f) = fms y(z) f (x) da 
for some (nondistributional) function y: IR* — L(H) — see [RS 2], Problem IX.53. 

We have presented the GW-axioms for the simplest case of a scalar field, which is 
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called Hermitean in connection with requirement (i) in (GW4). We may consider more 
general scalar fields such that the operators y(f) corresponding to real-valued f € 5 UUR*) 
are not symmetric. More important, however, are modifications to vector and spinor fields 
describing particles with nonzero spins: 


(a) in this case the fields are multicomponent operator-valued functions, i.e., y(f) = 
(yi(f),---,Vease1(f)) with yp; : SUR*) - L(H), and the transformation property 
of (GWE6) is replaced by U(A, a) p(f) U(A,a)~"b = (S(A~")y)(fa,a)¥ for any p € 
D, where S(A~!) corresponds to the matrix operator of Proposition 10.3.3. Fields 
with a half—integer spin change sign when rotated by 27. 

(b) the microcausality postulate differs with respect to the type of statistics: if the 
supports of f,g are space—like separated, then we have [y;(f), y(9)|<¥ = 0 for all 
the field components and any w € D, where the upper and lower sign correspond 
to the boson and fermion cases, respectively. 


In distinction to quantum mechanics, one can demonstrate here the connection between 
spin and statistics; namely, one can prove that a quantum field, which satisfies the GW- 
axioms with the listed modifications, is governed by Bose-Einstein or Fermi—Dirac statis- 
tics iff the spin of the particles involved is integer or half-integer, respectively. This result 
was obtained by M. Fierz and W. Pauli for free fields; the general proof was given by 
G. Liiders, B. Zumino and N. Burgoyne — see, e.g., [SW], Sec.4-4; [ BLT], Sec.5.3. 

Proof of Theorem 1 can be found, e.g., in | SW J, Sec.3-2; [ RS 2], See.X.7. Re 
lation (13.3) expresses the fact that the formal field operators satisfy the Klein-Gordon 
equation. Another example of systems which obey the GW-axioms is provided by the 
so-called generalized free fields — see, e.g., [| BLT ], Sec.3.4. They are superpositions 
of free fields of different masses, i.e., the one-particle space equals the direct integral 

<4 L?(Hm,dwm)do(m) where o is a positive measure; the generalized free field is then 
defined as in Sec.12.3. If o is supported by a single point we get a free field. 

Axiomatic quantum field theory splits in two main directions. One is concerned with 
general properties of fields, relations between different postulate systems, etc. On the 
other hand, constructive field theory aims at finding explicitly formulated examples of 
quantum fields satisfying some system of axioms. Since the problem is very complicated; 
the first attempts were concerned with models in a space-time of dimension d < 4. The 
first nontrivial model for d = 2 was found at the beginning of the seventies; a few years 
later examples were also obtained in the case d = 3 — see [GJ 1], [Si 2], [GJ], and 
more recently, e.g., [BFS 1], [Ost 1]. The drawback of the methods used was that they 
worked only for superrenormalizable models, i.e., such that the number of singularities in 
the formal perturbation-theory expansion was finite. In the middle of the eighties new 
models appeared (for d = 2,3) which were not superrenormalizable — see, e.g., [FMRS 1], 
[GK 1] and the review | Ost 1]— this gave a new impetus for study of physically most 
interesting case d = 4. Let us remark that constructive methods are not the only way 
to establish the existence of interacting quantum fields; for instance, there is an abstract 
result derived in [Hof 1] due to which a quantum field must exist, which satisfies the 
GW-axioms and differs from a generalized free field. 

Using the vacuum means we can define for any positive integer n a distribution 
Wr: Walfi,---> fn) = (Wo, 0(fi)---~(fn)Wo), which is called the (n-point) Wightman 
function. Starting from the GW-axioms, one can derive what they imply for the sequence 
{Wn}nii: the behavior at relativistic transformations, the consequences of the spectral 
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condition and microcausality, etc. However, it is also possible to reverse the order and to 
postulate these properties of the functions W,. This collection of requirements is called 
the Wightman azioms; their equivalence with the GW-axioms follows from the reconstruc- 
tion theorem, due to which a quantum field satisfying the GW-axioms corresponds to a 
sequence {W,,} which obeys the W-axioms, and this field is determined uniquely up to a 
unitary equivalence — see, e.g., [SW], Sec.3-4; [Si 2], Sec.II.1. 

Another axiomatic method is based on the fact that Wightman functions have certain 
analytical properties which allow us to study their continuation to regions of complexified 
configuration space where time is purely imaginary. These analytic continuations, so— 
called Schwinger functions, are then invariant with respect to the Euclidean group of 
nonhomogeneous linear transformations of the space JR* rather than to Pa Properties 
of Schwinger functions can be postulated as Osterwalder-Schrader axioms; these were 
formulated for the first time in [OS 1,2]; see also [ Si 2], Chap.II; [GJ], Sec.6.1. In 
this case, too, a reconstruction theorem is valid, which enables us to find the Wightman 
functions corresponding to the Schwinger functions which satisfy the OS—axioms. 

An alternative formulation of the Euclidean approach to quantum field theory em- 
ploys the notion of a Euclidean field, i.e., an operator object whose vacuum means are 
the Schwinger functions. This idea was proposed by J. Schwinger, T. Nakano and K. Szy- 
manczik; later, the required properties of such fields were formulated as the so-called 
Nelson azioms — see [Nel 3] and [Si 2], Sec.IV.1. If these axioms are satisfied the same 
is true for the OS—axioms, while the opposite implication is not valid. 

To illustrate the advantages of this approach we use an analogy with quantum me- 
chanics. Let H be a Schrédinger operator on L?(JR*). The treatment of the propagator 
e-*4t can be simplied if we are able to reconstruct it from knowledge of the operators 
et for t > 0; this is possible, e.g., when operator H is below bounded so the function 
z+ ez is strongly analytic in the halfplane {z: Imz <0} (see the notes to Sec.1.7). 
The operators e~#* can be expressed by means of the Feynman-Kac formula (9.8), whose 
right side is a well-defined mathematical object which may be treated by methods of in- 
tegration theory. In a similar way the Feynman-—Kac formula is used to express Euclidean 
fields and their vacuum means — cf. the notes to Section 9.4. 

Haag-Kastler axioms were formulated in {HK 1]; the construction of the algebra 
A, which is usually called the quasilocal algebra, can be found in [ BR 1 J, def.2.6.3. 
There is a modification of this system of postulates in which one associates a W*—algebra 
A(M) with any bounded open M c R?*. In this case we speak about the Haag-Araki 
axioms; the postulate of relativistic covariance then requires existence of a unitary strongly 
continuous representation U of Pi such that U(A,a)A(M)U(A, a)~! = A(A7!(M —a)) 
holds for any (A,a) and M. In this approach quasilocal observables are operator-norm 
limits of local observables. An extensive overview of these axioms, their modifications, 
and consequences can be found in the monograph | Hor]; see also the reviews { Haa 1,3], 
[ Ara 2]. Relations between HK-axioms and GW/OS-axioms are discussed in | GJ ], 
Chap.19; however, some problems remain open — see, e.g., [HV 1]. 


Problems 


1. Let a be any element of a proposition lattice £ and S CL. Prove: 
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(a) acl and aUa' =I, 
(b) Uses a= (Noes a’)’. 


Let a = 4(1 +0;) be the eigenprojection of the j-th component of spin corre- 
sponding to the value 4. Show that litineascli, EL)” = 0 holds for 7 #k. 


Let £ be a complete orthocomplemented lattice; then 


(a) the intersection of any family of sublattices in £ is a sublattice in CL, 
(b) if £ is Boolean, so is any sublattice of it, 

(c) the propositions a,a’ are compatible for any a € CL, 

(d) the propositions 9, J are compatible with any a€ L, 


(e) propositions a,b € £ are compatible iff the same is true for a,b’. 
Let B(H)® be the lattice of Example 13.1.3. Prove that 


(a) E(aUb) is for any a,b the projection onto the subspace (E(a)HU ET 
In particular, if a C b’, then the projections E(a), E(b) are orthogonal and 
E(aUb) = E(a) + E(b), 


(b) the lattice B(H)” is atomic. 


Let A be the algebra (7.18); then A” is an atomic weakly modular lattice and 
its center consists of the projections P := )°o¢y)CaEa, where the Eq are the 
projections onto the subspaces Ha, and the coefficients cq assume the values 0 or 
il. 


Suppose that CL is a proposition system and its center contains an element c different 
from @, J ; then any a € C can be expressed as a = a; Uaz2, where a; := ane 
and a2 := alc’. Prove that the lattices £L; := {a;: a € L} satisfy the condition 
(pl,) and extend this result to the case of a center generated by a finite number of 
propositions. 


Assume that A is an algebra of observables of the form (7.18) and W is a statistical 
operator, which is reduced by all the subspaces Ha. The function pw : A¥ — (0, 1] 
defined by pw(E) := Tr(EW) satisfies the conditions (s1)-(s4) of Section 13.2. 


Any dispersionless state is pure. 
Hint: Such states satisfy p(a) =0 or p(a)=1 for any aELl. 


Prove that the result of Problem 7 remains valid if we replace the complex Hilbert 
space by a real or quaternionic space. 


Chapter 14 


Schrodinger operators 


The most important operator class in nonrelativistic quantum mechanics is repre- 
sented by Schrédinger operators on L?(JR"), i.e., operators of the form (9.2), 
La 

i= — 

eB 2m; 


j=l j 


P?+V(Q) (14.1) 


for a given measurable function V : JR" — R called the potential, which we 
have repeatedly encountered in earlier chapters. Now we want to discuss their 
properties in a more systematic manner. Even so, however, we are able provide only 
an introduction to the subject; a guide for further reading is given in the notes. 


14.1 Self—adjointness 


Since operator (14.1) typically represents the total-energy observable, it has to be 
self-adjoint, or at least essentially self-adjoint to have a unique self-adjoint ex- 
tension. This is obviously true provided the potential is essentially bounded, i.e., 
V € L“~UR"). Unfortunately, such a class is too narrow for practical purposes; it is 
sufficient to recall the electrostatic interaction of a pair of charged particles. Hence 
we must look for more general self-adjointness criteria; in this section we are going 
to discuss some widely used conditions. 


14.1.1 Remarks: (a) Though the physically most important case corresponds to 
n = 3, and operators with n = 1,2 also play a distinguished role, it is 
reasonable to consider operators (14.1) for any n. Apart from mathematical 
interest, some results obtained in this way can be useful for treatment of the 
physically interesting case of N-particle systems. 


(b) On the other hand, without loss of generality we may assume that 2m, =--- = 
2m, = 1, since otherwise we can repeat the argument we have already used 
in the proof of Theorem 9.3.1, namely we can employ the substitution ope- 


ratorelcen (UG, ....42n)= Tes (2my)?p(V2m21,...,V2mMnan) , which 
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transforms (14.1) to UHU~! = Dh, P? + V(VJ2mQ:,..., VS 2mnQn) ; all 
the conditions on the potential that we shall discuss below are invariant with 
respect to this scaling transformation. 


(c) For the sake of brevity, we shall again use P? := )of_, P? and similar short- 
ened notation for functions of the momentum and position coordinate opera- 
tors. The free Schrédinger operator corresponding to V = 0 will be denoted 
alternatively as Ho. 


First we are going to derive a useful sufficient condition based on a perturbation— 
theory argument. We shall say that a function V belongs to (L?+ L™)(UR*) if it 
can be expressed in the form of asum, V = V,+ Vo, with V, € L? and V,€ L™ ; 
speaking about functions we have in mind, as usual, classes of functions which 
coincide a.e. in JR”. 


14.1.2 Theorem: Suppose that V € (L?+ L®)(UUR") , where 
p=2 if n<3, p>5 if n>4; (14.2) 


then H := Hj+ V(Q) is self-adjoint and C§°(UR™) is a core for it. 

Proof: In view of the Kato—Rellich theorem and Problem 7.16, it is sufficient to 
check that V(Q) is Ho-bounded with the relative bound less than one. If n < 3 
and w~ € D(H), we have 


Vell < WVovdll + Voor] < [Vall Iebtlco + [|Voolloo lll 5 


where the first term on the right side is finite due to (7.16). On the other hand, if 
n >A we obtain in a similar way using the Hélder inequality 


Vdll < [Vollp lle + I Voolleoll I 


with p-'+q~! = 3. This last named condition is valid for g > 2 if n = 4 and 
2 <q < 2n/(n—4) for n > 5, so the right side is again finite — see the notes to 
Section 7.5. Hence for any positive integer n and a > 0, there is a b > 0 such that 


IVQ)bl] < alVellp How ll + (llVollp + lVoolloo) Ill 


holds for all 7 € D(Ho) ; if we choose a in such a way that allV,||, < 1, the 
assumptions of Theorem 4.3.12 are satisfied. [J 


14.1.3 Example: Suppose that the potential V is essentially bounded with the 
exception of a finite number of point singularities r,,...,2y in the vicinity of which 
it obeys the restriction |V(x)| < c;|a—aj|~* for some positive c;, a,;. If the powers 
satisfy the condition a; < min(2, n/2), the theorem may be applied. 


However, in case of Schrodinger operators in L*(JR") with n > 3 we are often 
interested in potentials, which have singularities with respect to a part of the coor- 
dinates only. The most important consequence of Theorem 2 concerns systems of 
N particles, which interact 
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(i) with an external field, 
(ii) with the other particles by means of translation—-invariant two-particle forces; 


7.€., interactions involving more than two particles are excluded. A common property 
of the corresponding potentials V is that there is a three-dimensional projection E 
on JR® such that 

V(x) = V(Ez). (14.3) 
In other words, V(x) depends on one three—dimensional vector argument only; given 
V , we can choose a basis in JR°% and a function V : IR? — R in such a way that 
Wile. .,2n) = V(a) ; in case (ii) the vector x, is associated with the relative 
position ‘of the two particles. 


14.1.4 Theorem (Kato): Suppose that V := )fL, Vk, where each of the potentials 
V, satisfies condition (14.3) and the corresponding Vz € (L?+ L©)(R’) for any 
k =1,...,m. Then the operator H :-= Hyp + V(Q) is self-adjoint and Co°(R®) 
is a core ia if. 
Proof: We use the fact that the assumption of the Kato-Rellich theorem may be 
equivalently expressed in a quadratic form — cf. Section 4.4. Given Vi. we choose 
a basis in IR°% in the way described above, and use Af (*) to denote the self-adjoint 
extension of — )>3_, 0?/Oz?, with the domain S UR®), Mimicking the argument 
from the proof of eee 2, we find that for any @ > 0 there is a G > 0 such 
that 

[[Ve(Qa)vll? < a? [LAO wll? + Bellwll? 
holds for any 7 € D(A) , where the norms are related to L?(JR°) and V(Q:) 
is shorthand for V~(Q11, Q12, Qiz) . Next we employ the decomposition L?(JR?%) = 
L? UR?) @ L?(UR?%-3) . Due to Example 5.5.8 and Theorem 5.7.2, we have V;.(Q) = 
V;i(Q1) @ I, and similarly, HO) = AW @ I. Since the last named operator is self- 
adjoint, and therefore closed, we readily see that 


VelQvol? < a7 Ho wll? + Belle? 


holds for all yw € D(AG ys The identity F3y = F3 ® F3n-3 in combination with 
(7.4) implies that H{” is unitarily equivalent to the operator of multiplication by 
hy : he(21,...,2n) = |x, |? ; it follows that D(H§”) Cc D(H) and Hi yl? < 
\|How||? for any a € D(H), and therefore 


ViCQ)v|? < a || Holl? + Bele’. 


What is important is that this estimate is independent of the choice of the coordinate 
frame in IR?" that we used to derive it. Now we denote @ := maxicecm(, and 
use the Schwarz inequality; we obtain 


IV(Q)¥l? < 2) VeCQvall [VQ < m2 (0? Howll?-+ IVI) 
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for any w € D(Hp) ; hence, to conclude the proof it is sufficient to choose a in such 
away that m7a?<1. fi 


The proved theorem shows, in particular, that the standard quantum mechanical 
description of atoms and molecules as particle systems with a pair electrostatic 
interaction is consistent. 


14.1.5 Example: Let A; be the Laplace operator corresponding to the coordinates 
Lj = (51,242,233) of the j-th particle. Then the Hamiltonian 


H: Hy = ——-Aoy Aj 
2M ° 2 » j 
(14.4) 
Z Ze2 if Z e2 
~ +> > <7, 
2 ae oe 


which describes a neutral atom with Z electrons, is e.s.a. on C§°(IR°2**) (Prob- 
lem la). The same is true for other systems consisting of a finite numbers of particles 
with Coulomb interaction: molecules, atomic and molecular ions, etc. 


Another class of potentials, for which one can prove the essential self-adjointness 
of operator (14.1), consists of functions which are below bounded; the idea is that 
in such a case, sufficiently large negative numbers will be regular values of H and 
Proposition 4.7.4b will apply. Of course, the potential has to obey some regularity 
requirements; we shall assume that it belongs to L?,.(JR"), i.e., it is square inte- 
grable on any compact subset of JR". We shall not discuss technical aspects of 
the problem (see the notes) and shall restrict ourselves to quoting the result. Be- 
fore doing so, however, we remark that a stronger result can be obtained when the 
mentioned idea is combined with the perturbation-theory argument of Theorem 2; 
this allows us to treat potentials which have some negative singularities as well. We 
define 

Va : Va(x) = + miexGEV (z), 0); 
then the following assertion is valid. 


14.1.6 Theorem: Suppose that potential V is such that V, € L2.(IR") and 
V_ € (L?+ L™)UR") for some p which satisfies condition (14.2); then the operator 
H := Ho + V(Q) is e.s.a. on CO°UR"). 


This result applies to some physically interesting potentials which are not cove- 
red by Theorems 2 and 4. 


14.1.7 Example: Heavy mesons, such as J/WV and YT, are unstable but long-living 
(in the appropriate time scale), so they are successfully modelled as nonrelativistic 
quark—antiquark systems interacting by means of a confining potential, for instance, 
V: V(r) =Ar+8-7/r, \ >0, where r := |xz2—z;| is the quark distance, 7 is the 
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squared quark charge, and \, 6 are phenomenological constants. Due to Theorem 6 
and Problem 1a, such a description is consistent. 


There are other methods for proving the self—adjointness of Schrédinger opera- 
tors. Some of them have been mentioned above; recall the application of the Nelson 
theorem to the anharmonic-oscillator Hamiltonian in Example 5.6.4, or the com- 
mutator estimate used in Example 7.2.5 — the applicability of these ideas is not 
restricted to the particular situations, in which they have been discussed. 

In addition to (14.1), we often encounter various types of generalized Schrédin- 
ger operators. One of these involves interactions which depend not only on particle 
positions but also on their momenta. A typical example is the Hamiltonian of 
spinless particles of charges q,,...,@n placed into a magnetic field described by a 
vector potential A, 


=> oe (Po + 4 AQ) + VQ), (14.5) 


j=l 


where A%)(Q) is a shorthand for the operator-valued vector with the components 
(AP ep) (a1, ..., Ln) == A,(z;)¥(21,---,2n) ; a possible electrostatic interaction is 
included into the potential V . 

For the sake of simplicity, we limit ourselves to the case of a single particle; in 
addition, we put 2m = 1 in accordance with Remark 1b and g/c = —1. Using 
canonical commutation relations we find that the Hamiltonian may then be rewritten 
in the form 


H: Hy = —Ap 4+ 2A-Vy + i(V-A)d + A? + Vb (14.6) 
provided we choose a suitable domain, say, Co°(UR®) . 


14.1.8 Theorem: Suppose that V € (L?+ L®)(R?) and the vector—potential 
components are continuously differentiable functions such that A,, V-A € L©(R3). 
Then operator (14.6) is e.s.a. on CS°(R%). 

Proof: The interaction term in (14.6) is a symmetric operator. Furthermore, it is 
easy to check that the relative bound of a sum of perturbations is less than or equal 
to the sum of the relative bounds; since the relative bound of V(Q) is zero and the 
functions V-A, A? are bounded, it is sufficient to check the operator A-P. We have 


s dxb(z) ? ae E ia Hi? 
iar = (3 pisors is) <a(3> tai?) = apni 


for any ~ € C&°(IR3) , where a? := maxis ||Aj|loo. Estimating the right side we 
obtain 


|A-P]| < a6l| Howl) + ae wll 


(Problem 2); hence we need only to choose § so that aB<1. Ff 
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14.1.9 Remarks: (a) The fact that vector potential itself is not an observable 
quantity extends the applicability of this result. It is sufficient to find a suitable 
gauge-transformed potential A’ = A— VA which satisfies the assumption of 
the theorem; then the Hamiltonian corresponding to the original A is also 
e.s.a. and its core is readily obtained from the result of Problem 10.11. 


(b) The theorem admits a substantial generalization. We shall comment on it 
briefly in the notes; it is important that essential self-adjointness can be proven 
for physically interesting situations such as an N-electron atom in a homoge- 
neous magnetic field described by A(x) := 42 A Bo for a fixed vector Bo ; 
this is the situation in which the Zeeman effect is observed. 


14.2 The minimax principle. Analytic perturba- 
tions 


Before proceeding, we want to discuss two methods which make it possible to obtain 
information about spectra by comparing different operators. Their applicability is 
not restricted to Schrédinger operators. For instance, the first method may be 
used for any below bounded self-adjoint operator; we have already encountered a 
modification of it in Corollary 3.5.10. 


14.2.1 Theorem (minimaz principle): Let H be a self-adjoint below bounded 
operator on an infinite-dimensional Hilbert space #1. We denote by D,, an arbitrary 
n-dimensional subspace in H and set 


mi(H) = gph (¥, Hy), 
wil =1 
(14.7) 
nti(H) = =1,2... 
Mn+i(H) oP epin pun (¥,Hy), n=1,2, 
vl] =1 


The sequence {y,,(H)}%2, is nondecreasing, and for any positive integer n the 
following alternative is valid: 


(i) if there is k > n such that px(H) > un(H), then H has at least n eigenvalues 
(taking their multiplicity into account) and y,,(H) is an eigenvalue of H , 


(ii) if e(47) = un(H) holds for all k > n, then H has at most n—1 eigenvalues 
(counting the multiplicity) and y,p(H) = inf o¢,,(H) . 


Proof: For brevity, we write Un := fn(H). The inequality 2 > pz, obviously holds. 
If n > 2, then D,, is uniquely determined by a subspace D,_; C D, and a vector 
@ € Dn \ Dn-1- Using the identity (Da_1 U {¢})+ = D+_, N {6}4, we obtain 


(b, Hy) 


Ln+1 = sup sup inf 
Dpa-1CH ¢€Dn-1 WE a ii U {¢})+ 
yl =1 
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> su su inf Aw) = 
Daal Bemk ve DUH) NDE, aaa 
ily] = 1 


e., the sequence {un}%, is nondecreasing. Next we as usual denote Ey := 
Ex(—oo, A] and verify the inequalities 


dimh,<n if AX<p,, dimE, >n if A> pn. (14.8) 


By Corollary 4.3.5b and Proposition 5.4.1, they are valid for n = 1 ; hence it is 
sufficient to assume n > 2. Since maaiet H is supposed to be Below bounded, 
it follows from the spectral theorem that w~ € Ran) belongs to D(H) for any 
A € IR; this fact is decisive for the rest of the proof. 

Suppose that the first of the inequalities (14.8) is not valid forsome  < py, ; then 
there is an n-dimensional subspace £ C Ran FE, C D(H) and (w, Hw) < Allyl? 
holds for all x € L. Due to prion 3, to any (n—1)-dimensional subspace D,_; 
we can find a nonzero vector wy € D-_, NL. We infer that 


(b, Hy) <A, 


inf 
yé€ D(H) ADL, 
lvl] =1 
€., Ln < »; this proves the first inequality. Assume next that the second of the 
relations (14.8) does not hold, i.e., dim Ey, <n—1 for some \ > y,. In that case 
we may choose an (n—1)—dimensional subspace D,_; D Ran F) ; this inclusion 
together with properties of the spectral measure implies 


D(A) N ae DS (S (Ran E,)+ = = Ran Ex(, oo) ' 


so (w, Hw) > Allp||? holds for any ~ € D(H) MN Dz, , and therefore pn > \ which 
contradicts the assumption; it means that the second inequality is also valid. 

Relations (14.8) and Proposition 5.4.1a show that each yu, belongs to the spec- 
trum of H , because dim Ey(Un—€,Unt+é) > dim Ey|[un—€/2,un+e/2] > 1 holds 
for any € > 0. Furthermore, the function 4 + dim £) is nondecreasing and its 
values are non-negative integers or +oo. At each A := yu, the following mutually 
exclusive possibilities arise: 


(i) dim E,,.4, < co forsome 7 > 0. Then dim Ey(u,—€, uate) < dim E,,4. < 00 
for any € € (0,7), and therefore , € oa(H) in view of Theorem 5.4.3a; in 
other words, jn is an isolated eigenvalue of a finite multiplicity and we have 
o(H) NM (Un—- 6,un+ 6) = {un} for all sufficiently small 6 > 0. The second 
of the inequalities (14.8) gives dim E,, = E,,46 =n; hence the operator H 
has at least n eigenvalues in (—0o, fn). We arrange them in ascending order, 
Ay < Ay <--:. The inequality An < un would mean dim Fy, > n but this 
contradicts the first of the inequalities (14.8), and therefore u, = A,. Since 
the multiplicity of jn is finite and the points Un41,Un42,-... belong to o(H), 
there is a k such that pp > pn. 
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(ii) dim E,,4¢ = 00 for all e > 0. Then (14.8) implies dim E,,-</2 < n—1, so 
dim Ey (iin Ey, Unt E) > dim Ete /2 — dim Eun—e/2 = 00 and [m & Cag!) 
by Theorem 5.4.3a. Assume further that  < pn belongs to Gess(H) ; then 
dim Ey (A—7, A+) < dim Eyyy <n holds for all 7 € (0, 4n—) which contra- 
dicts the assumption; this means that pn = inf oes,(H) . Next we suppose that 
Lk > Un for some k > 1; then (14.8) gives dim Evyityn)/2 < & which is again 
impossible. Finally, assume that there are eigenvalues A; < --* < An < Un; 
then dim E(in+d,)/2 >, which is once more in contradiction with the first of 
the inequalities (14.8). This concludes the proof. § 


14.2.2 Remarks: (a) The theorem also holds, of course, in the case dim H < oo, 
where we may drop the domain D(#) in the definition of u»n(H). The same is 
true for any bounded H on an infinite-dimensional 71. However, there is an 
important difference. The theorem describes only that part of the spectrum 
which is below info.se(H) (or above supde.s(H) ; if H is bounded from 
above, we can apply the result to the operator —H). In the first case oes.(H) 
is necessarily empty, so the theorem describes the whole spectrum; this is not 
true in general if dim#{ = co (cf. Problem 4). 


(b) The relations (14.7) can be written in a unified manner if we set Do := {0}. 
Suppose further that A; <--- < Angi <--: are the eigenvalues of H situated 
below the bottom of the essential spectrum (with the multiplicity taken into 
account), and w,...,Wn41,--- are the corresponding normalized eigenvectors. 
Substituting D, := {W,...,%n}un and W := Wn41 into the relations (14.7), 
we see that the supremum and infimum may be replaced by the maximum and 
minimum, respectively; hence the name of the theorem. 


(c) The relations (14.7) allow various modifications. For instance, we have 


ee sup inf . (y, Hy), 
$14--5¢n—-1EH Y = {¢1, oes ,n-1} 
¥ € D(H), |p|] =1 
where the vectors ¢),...,¢n, are not supposed to be linearly independent, or 
7) = 90 inf teed 
fae Danich ¥ Dea natH HY), 
Iv] =1 


where Q(H) is the form domain of operator H (Problem 5). 


As a simple consequence of the minimax principle, let us mention the coupling- 
constant dependence of eigenvalues. 


14.2.3 Proposition: Let Ho, V be self-adjoint operators and Hp > 0. Suppose 
further that V is Ho-bounded with zero relative bound, and that the operator 
Hy, := HotgV satisfies oess(H,) = IR* for all g > 0. Then for any n, the function 
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9 ++ [m(H,) is continuous and nonincreasing on [0,00) ; moreover, it is strictly 
monotonic at the points where jin(H,) <0. 

Proof: In view of the Kato-Rellich theorem, H, is self-adjoint and D(H,) = D(Ho) 
for all g > 0. The numbers y,(H,) are nonpositive by assumption; hence 


bn( Hg) = a want gen Oe (b, Ho) + 9(y, VY) } 


irvil =1 
Since (w~, How) > 0, we readily check that g + min{0, (W, How) + g(v, Vy)} is 
nonincreasing in JR*+ for any ~ € D(Hpo) ; the same is then also true for the 
function g++ u,,(H,). The continuity follows from the analytic perturbation theory 
for isolated eigenvalues — see Theorem 7 below. 

Assume further that pn(H,) < 0 and denote by qW,...,Wn the normalized 
eigenvectors of H, corresponding to the lowest n eigenvalues (counting the mul- 
tiplicity). We have (¥,Vw~) < 0 for any w € L := {Wy,...,Un}tin, because 
(w, How) > 0 holds by assumption. The subspace L is finite-dimensional; hence 
the unit sphere S:= {we L: |\w|| = 1} is compact, and since ~ / (w, Vw) 
is continuous on S, Corollary 1.3.7c implies existence of a positive c such that 
(w,Vw) < —c holds for any w € S. In view of Problem 3, we can find to any 
Dn-1 C H aunit y € D41_, NL; this vector satisfies 


for any 6 > 0. At the same time, it is not difficult to check that (~, Hj) < 
(bn, Hy,) = — ) holds for each w € L, and therefore 


meee, — (, Hots) S ee (), Hgts) < Un( Hy) — 06 
foil Su ae ay 
fomeny D,.;G 71, 1.¢., pn(Hg4s) < pn(H,). OB 


14.2.4 Example: Consider the operator H, := Ho+gV(Q) on L?(R") with a 
potential V € L?(UR"), where p satisfies condition (14.2). By Theorem 14.1.2, 
V(Q) is Ho-bounded with zero relative bound; in Section 14.4 we shall show that 
Oess(Hy) = Oess(Ho) = IR* holds for any g € IR. Proposition 3 may then be ap- 
plied either to the operators H, with g € (0,00), or to H, = Ho— g(—V) with 
g € (—co, 0]. We are interested, of course, in the nontrivial situation when H, has 
isolated (1.e., negative) eigenvalues of a finite multiplicity — in dependence on the 
potential this may occur in both cases, in one of them, or not at all. Proposition 3 
implies that the number of eigenvalues (counting the multiplicity) is nondecreas- 
ing as a function of |g|, and the bottom of the spectrum i(H,) = info(H,) is 
continuously decreasing with respect to |g| provided there is at least one negative 
eigenvalue. This conclusion extends to potentials V € (17+ L2°)(IR™) which we 
shall discuss below (cf. Problem 1). 


We know from Section 5.4 that if H is a below bounded self-adjoint operator 
and w~ is a unit vector from D(H), then (~,Hw) > info(H). The minimax 
principle allows us to generalize this result substantially. 
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14.2.5 Proposition: Assume that H is self-adjoint and L is an N-dimensional 
subspace in D(H) ; we denote the corresponding projection as P and set Hy := 
PHP. Let \P <---< AK, be the eigenvalues of the operator H,} L ; then 


pall) <a tema lige RR 


In particular, if og(H) consists of the eigenvalues A; < --: < Am (counting the 
multiplicity), then A, <A? holds for n= 1,...,min(m,N). 

Proof: The operator H,} L is obviously Hermitean and satisfies (W, Hiw) = (W, Hv) 
for y% € L; then Theorem 1 gives 


P= su in , Hy) = sup inf (b, Hy) 
e sail yeDi_, OL wv ¥) Dn-1CH y €(PDp-1)t NL 
Il] = 1 i] = 1 


(cf. Remark 2c). Furthermore, using the fact that (~,P@) = (~,¢) holds for 
aw € L, we obtain 


\P = sup inf (b,Hw) > sup inf (b, Hw) = pra(H). Of 
Dageh WERE nb Drege sek eed 
lol] = 1 “Wb =1 


The obtained result makes it possible to estimate the eigenvalues of H from 
above by finite-dimensional approximations; this is the basic idea of the so-called 
Rayletgh-Ritz method, which is widely used in practical calculations. 

The second topic we want to speak about here is the perturbation theory. This 
term covers an extremely wide range of problems. A few of them have been discussed 
above; let us recall 


(a) the stability of self-adjointness with respect to relatively bounded perturbations 
which forms the content of Theorems 4.3.12 and 4.6.14, 


(b) the stability of o.,, under relatively compact perturbations — Theorem 5.4.6, 
(c) time-dependent perturbations — cf. Example 9.5.5 and the notes to Section 9.5, 
(d) perturbations of embedded eigenvalues — see Section 9.6 and the notes to it. 


However, for a quantum physicist the term is associated primarily with perturbations 
of isolated eigenvalues. It is a vast subject which has not been mentioned up to now; 
we limit ourselves to an illustration of the ideas on a particular class of operators. 
We shall need the following result, which is not difficult to prove for self-adjoint 
operators (Problem 6); for the general case see the notes. 


14.2.6 Theorem: Let T be a closed operator and z an isolated point of o(T). 
Suppose that I’ C p(T) is a closed curve enclosing the point zp clockwise, and 
define ' 
— a [leiden 

21 I r(2) 
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Then operator P is a projection, in general nonorthogonal, and its range is an 
invariant subspace of T. If RanP is one-dimensional, it is an eigenspace of T 
corresponding to the eigenvalue z ; more generally, if dim Ran P < oo, then the 
identity (T—z)"~ = 0 holds for each y € Ran P and a positive integer n. 


Any perturbation problem concerns a class of operators depending on a pa- 
rameter (or parameters); the simplest situation corresponds to the case when this 
dependence is linear. We shall again consider relatively bounded perturbations. 


14.2.7 Theorem: Suppose that Ho, V are self-adjoint and V is Hp—bounded; we 
set H, := Ho+ QV forall ge. 


‘(a) Let z € p(Hp) ; then there is a complex neighborhood U of the point g = 0 
such that the operator—valued function g++ (H,— z)~' is analytic in U. 


(b) Let Ao be a simple isolated eigenvalue of Ho with an eigenvector w%. Then 
for all sufficiently small |g|, the operator H, has a simple eigenvalue (9) ; 
the function X(-) is analytic in the vicinity of g =0 and A(0) = Ao. 


Proof: Suppose that the operators Ho, V satisfy the inequality analogous to (4.3). 
Mimicking the argument from the proof of Theorem 4.3.12, we find 


I|V(Ho— z)~*|| < al] Ho(Ho— z)~" || + 6|(Ho— z)~"I 


for any z € p(Hp), so the operator on the left side is bounded; now we do not 
care about the value of a because the perturbation V enters H, with the coupling 
constant g. By Lemma 1.7.3 and Theorem 1.7.5, the operators (I+gV(Ho—z)7!)7! 
then exist for small enough |g] and g++ (1 + gV(Ho— z)~1)~ is analytic in the 
vicinity of g = 0. The last named property is preserved when the function is 
multiplied by a bounded g-—independent operator; furthermore, by Problem 1.63 we 
have z € p(H,) and 


(Hy— z)7! = (Ho— z)"! (1+. gV(Ho— 2)7)7? (14.9) 


for sufficiently small |g| ; this proves assertion (a). 

Next we use Theorem 6. We set T'.(Ao) := {z= Zp = Aotee* : yw € (0,27) }. 
Since Ao is isolated due to the assumption, we can achieve [.(A9) € p(Ho) by 
choosing € small enough. Then 


se ar 
P(g) := L : ae gV — z) ‘dz (14.10) 


2171 
makes sense if g = 0 ; we want to check that this remains true in some neighborhood 
of this point. By the already proven first part, g + (H,—z,)~' is analytic in an 
open set U, containing g = 0; hence it is sufficient to check that U := \ocyeon Uy 
is open. The resolvent (H,—z)~' is expressed by a power series whose radius of 
convergence is go := ||V(Ho— z)~’||"’. Using Example 4.3.6 we readily find that 
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\|(Ho— z)~4|| < e7! and ||Ho(Ho— z)7~!|| < 2+ |Aole7! provided € is chosen small 
enough; this gives the inequality 


al E 

90 2 [|| Ho(Ho— z)~"|| + 6||(Ho- 2)" 2 a(ol + 22) 28" 

which shows that the radius of U is positive. It now readily follows from Theo- 
rem A.5.3 that the operator—valued function P(-) is analytic around g = 0. 

This means, in particular, that it is continuous; then ||P(-)|| is also continuous 
and since it assumes integer values, it is constant in the vicinity of g = 0. Hence 
dim Ran P(g) = 1 so, by Theorem 6, for |g| sufficiently small, H, has a simple 
eigenvalue A(g) and P(g)yo is the corresponding eigenvector. The latter is not 
normalized, in general, but its norm is a continuous function of g ; this means that 
it is nonzero around g = 0 and we may introduce w(g) := ||P(g)¥oll"'P(g)vo- 
Finally, we have (H,— z)~'#(g) = (A(g)—z)7'(g) , and therefore 


(Ho, (Hy— 2)" P(g) Yo 
(vo, P(g)%o) 


for z € I, (Ao) ; the left side is analytic with respect to g so the same is true for 
the right side ; then the function X(-) is also analytic in the vicinity of the point 
g=0: EF 


In this way one is able not only to prove the existence of the perturbed eigen- 
values and their analyticity with respect to the parameter (coupling constant), but 
also to compute the coefficients of the appropriate Taylor expansion, which is usually 
named the Rayleigh-Schrédinger series. The starting point is the operator P(g) ; 
note that we are usually interested in the real coupling constant, in which case H, 
is self-adjoint and by Problem 6, P(g) is an orthogonal projection (a spectral pro- 
jection of the perturbed operator). Writing the last operator on the right side of 
(14.9) as a geometric series and substituting into (14.10), we obtain 


= (X(g)-2)"" 


Bil) = = yo (-a)" f (Ho- 2)" [V(Ho= 2)-"]" dz. 


(Ao) 


This formula determines the perturbed eigenvalue by (9g) = (w(g), (Ho+9V)v(Q)) ; 
after the necessary algebra we not only recover the first few terms of the Rayleigh- 
Schrodinger series known from quantum mechanics textbooks, but can also obtain 
an expression for the general term — see the references mentioned in the notes. 


14.3 The discrete spectrum 


In quantum mechanics we associate eigenvectors of the operators (14.1) with bound 
states of the corresponding system of particles; a motivation for this interpretation 
will be mentioned in Section 15.1 below. The most important are isolated eigenvalues 
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of a finite multiplicity, i.e., points of the discrete spectrum. In this section we are 
going to discuss some of their properties. 

The first question to be asked concerns the number of bound states, which is 
understood as the dimension of the spectral projection corresponding to oq(H). 
We denote this quantity for operator (14.1) with a potential V as N(V), and 
present some results which allow us to estimate N(V) from above. 

Let us begin with the case of a centrally symmetric potential which we discussed 
in Section 11.5, V(x) = u(r) for some V : IR+t — RR. To check the essential self-— 
adjointness of H := Hp+ V(Q), we have to prove this property for the partial-wave 
operators (11.19). In fact, in most cases only the s-wave operator ho is important 
(cf. Problem 7a); on the other hand, one has to choose properly the boundary 
condition at the origin which specifies the self-adjoint extension ho (Problem 7b). 

Let ne(V) be the number of bound states of the operator he. Since dimG, = 
2€+1, the decomposition UL?(IR?) = OF, L?(UR*) ® Ge of Section 11.5 gives 


Cc 
N(V) = >> (2@+1)n(V). (14.11) 

£=0 
Hence it is sufficient to find the numbers ne(V) of “partial-wave” bound states; 
in this way the problem is reduced to analysis of ordinary differential operators. 


We shall quote a typical result for potentials of the class L?+ L& introduced in 
Problem 1. 


14.3.1 Theorem (GMGT bound): Let V € (L?+L%)(IR®) be a centrally symmetric 
potential, V(x) = u(r) ; then the number of “partial-wave” bound states of operator 
(14.1) with 2m = 1 satisfies the inequality 


ne(V) < ae (20+1)-@P-) ie r?P—lay(r)|? dr (14.12) 


for any p> 1. 


14.3.2 Remarks: (a) The bound for a general mass is obtained easily using the 
scaling transformation of Remark 14.1.1b. In the particular case of p= 1 the 
inequality (14.12) yields the classical Bargmann bound 


it 00 
< — F 
n(V) < Pal [ r|v(r)| dr 


(b) By Theorem 14.1.2 and Problem 4.22, operator H is below bounded under the 
assumptions of the theorem. The minimax principle then allows us to replace 
the potential by its negative part. We have, for instance, 


il {o's} 
n(V) < ne(V_) < a1 il r|v_(r)| dr , 


and and analogous relations for p > 1 ; similar upper bounds exist for the 
number of bound states with energies smaller than a given value (Problem 8). 
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(c) The bound shows, in particular, that operator (14.1) on L?(JR*) with a po- 
tential which is purely attractive, i.e., assuming nonpositive values only, may 
have no bound state at all if the interaction is weak enough. For example, the 
Bargmann bound shows that this is the case if 


is rlu(r)|dr < 1. 


A similar conclusion can be made for Schrédinger operators on L?(JR”) with 
n > 3; in contrast, we shall see below that this cannot happen for n = 1,2. 


Let us turn to potentials without a rotational symmetry. Since our aim is pri- 
marily to illustrate the methods, in the proofs we shall impose rather strong reg- 
ularity assumptions on the potentials; in the notes we indicate how they can be 
weakened. We begin with the following argument: if the Schrédinger operator 
H, := Ho+gV(Q) with a potential V ¢ L°(UR”) has a negative eigenvalue 2, then 
the corresponding eigenvector satisfies the identity ~ = —g(Ho— A)~'V(Q)w. We 
denote V(Q)!/? := |V(Q)|'/*sgn V(Q) and ¢:= |V(Q)|}/w ; then the last relation 
can be rewritten as 


¢ = —g|V(Q)|/?(Ho— A)“1V(Q)76. 


On the other hand, suppose that a vector ¢ € L*(JR") solves the last equation. 
The operator V(Q)!/? is bounded by assumption, so V(Q)!/2¢ € L?(IR") and the 
vector  := —g(Ho— A)~!V(Q)'/2 belongs to D(Ho) and ¢= |V(Q)|!/2y, ie., 


~ = —g(Ho— \)7'V(Q)'?|V(Q) [2b = —9(Ho— A)7'V(Q)w. 


Furthermore, Problem 1.63 gives I + g(Ho— \)~'V(Q) = (Ho— A)~)(Hy— 2), and 
therefore (Ho— »)~'(H,— A)w = 0 ; however,  ¢ o(Ho) in view of (7.14), which 
means that w is an eigenvector of H, with the eigenvalue . In this way, we have 
obtained the following useful result. 


14.3.3 Lemma (Birman: Schwinger principle): Let V € L®(IR") ; then —«? is an 
eigenvalue of H, := Ho+ gV(Q) for some k > 0 iff the operator 


Ky = |V(Q)|'?(Ho+ «?)-'V(Q)? (14.13) 
has an eigenvalue —g~!. 


14.3.4 Remark: Once we drop the assumption V € L®(JR") in the lemma, the 
vector |V(Q)|'/2~) may not belong to L?, in which case the right side of (14.13) 
has no meaning. However, the resolvent (Ho+4?)~! is explicitly known from Prob- 
lem 7.18; this allows us to define K, as an integral operator and extend the validity 
of the lemma to a much wider class of potentials (see the notes). A similar integral- 
operator representation of |V(Q)|!/2(Ho— z)~!V(Q)'? exists for any z € (\ Rt. 
~ more general context, we shall encounter this problem in Proposition 15.1.3 
elow. 
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Using Lemma 3, one can prove a bound on N(V) for potentials of the Rollnik 
class A introduced in the notes to Section 14.1. The Schrédinger operator must 
then in general be defined as the form sum Hy+V(Q) ; however, if the corresponding 
operator sum is e.s.a., then the two definitions coincide — see Remark 4.6.12. 


14.3.5 Theorem (Birman-Schwinger bound): Consider the Schrodinger operator 
H := Ho+V(Q) on L?(UR%) with a potential V € R. The number N(V) of bound 
states is finite and satisfies the inequality 


1 IV (x) IV y)| 
NV) $ 5 18 ea ge ee: (14.14) 


Proof will be given for V € C®°(JR®) ; for the general case see the notes. As in 
Remark 2b, in view of the minimax principle we may restrict ourselves to the case 
when V(x) < 0 holds for all 2 € R?. Let Hy := Hot gV(Q), i.e, H= AM; 
then another consequence of this principle contained in Proposition 14.2.3 tells us 
that the values u,(H,) are continuous and nondecreasing functions of the coupling 
constant g. We consider N,(V) defined in Problem 8 for \ := —K? < 0. This 
quantity is obviously equal to the maximal n for which ppz(H1) < 4; it follows from 
the continuity of g ++ u,(H,) that for any such n there is a number g, € (0,1) 
such that y,(H,,) = 4. These numbers satisfy the inequalities 0 < g) < g. <-::; 
for the other n with y,(H,) > » the numbers g, may exist but g, > 1. This 
means that the quantity N)(V) is at the same time equal to the maximal n for 
which g, < 1; this yields the inequality 


Nx(V) oo 
NY). < >> ox? = Soe: 
n=1 n=1 


Next we use Lemma 3 according to which H,, has an eigenvalue \ iff —g;7! is an 
eigenvalue of the operator (14.13); this implies 


N,(V) < > ae 
n=] 
where Kn, n = 1,2,..., are the eigenvalues of K,. By Remark 4, the latter is an 
integral operator with the kernel 


_ V@) pe aie) 2 


K,(x, y) cae An|x —_ y| 


which is real and symmetric, t.e., A, is Hermitean. Moreover, it follows from the 
assumption V € C&C R that K,(-,-) € L?UR®), which means that the operator 
K,, is Hilbert-Schmidt. Hence we may use Proposition 3.6.6 and Remark 3.6.3 to 
estimate 


. 1 VOM) —acie- 
N\(V) < Tr(KiK.) = Téq2 ie cae ae] WI dx dy. (14.15) 
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To conclude the proof; it is sufficient to express N(V) = limyo- N,(V) by means 
of the dominated-convergence theorem. [§ 


As another illustration of the Birman-Schwinger principle, let us prove the claim 
made in Remark 2c. 


14.3.6 Theorem: Let H, := Hot+gV(Q) be the Schrédinger operator on L?(JR”) 
with a potential V € CS°(IR") , which is nonzero and purely attractive. If n= 1,2, 
then H, has at least one bound state for g > 0. 

Proof: In view of Proposition 14.2.3 we have to check that 1(H,) < 0 holds for 
all g > 0. As in the preceding proof, a number g; = g:(K) such that i(Hy,) = » 
corresponds to any \ = —K? <0, and by Lemma 3 this is equivalent to the fact that 
—g;' is an eigenvalue of K,. The mentioned requirement will then be satisfied if 
for any g > 0 there is a « > 0 such that g;(«) < g, i.e., that the corresponding 
—K,, has an eigenvalue g,(k)~! > g~!. Since the potential is of Cg° and purely 
attractive, —K, is for any « > 0 a positive Hilbert-Schmidt operator; hence it has 
a purely discrete spectrum and its norm equals its maximal eigenvalue. It is then 
sufficient to show that lim,_.04. ||Kx|| = 00, i.e., to find a vector ¢ € L?(JR") such 
that the relation 


Jim, (6, 1V(Q)? Hot 2)" 1V(Q)74) = 00 (14.16) 


is satisfied; this can easily be achieved if n= 1,2 (Problem 9a). & 
We shall mention one more upper bound (see the notes). 


14.3.7 Theorem (Cwikel-Lieb-Rosenblium): Assume that the operator (14.1) on 
2"), n> 3, with Im; = 1, j= 1,...9m, and V € L*/GR")sis e:s.e. Then 
there is a number c, such that 


N(V) < Ga [. \V_(x)|"/? dar. (14.17) 


14.3.8 Remark: One can not only establish the existence of the constants c, , but 
also to find various estimates for their values, for instance, 


In the case 3 <n <7 we have c® > c@ and the Lieb-Thirring conjecture claims 


that the first inequality provides an optimal lower bound. In the particular case 
n = 3 we also have an upper bound, 


@ > := 


cs = 0.0780 < cs < 0.1156. 


On the other hand, for n > 8 we have c@ > c8 ; there are examples showing that, 


n 


in general, the bound c, > c@ is not saturated. 
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Let us now ask more generally under which circumstances the discrete spectrum 
is finite or infinite. The above discussed bounds suggest that the answer may depend 
on the behavior of the function V at large distances. In the three—dimensional case, 
we have the following result. 


14.3.9 Theorem: Let H := Ho+V(Q) be the operator.(14.1) on L?(R%) with a 
potential V € (Z?+ L&)(R). 


(a) If there are c € [0, +) and ro > 0 such that V(x) > —cr~? holds for any 
r:= |z| > 10, then og(H) is finite. 


(b) If there are positive ro,d,eé such that V(r) < —dr7?** holds for all r > 19, 
then o4({#) is infinite. 


Proof: In the next section we shall prove that oess(H) = [0,00) ; hence we have 
to find the number of negative eigenvalues (including their multiplicities). Had 
the potential decayed slightly faster at infinity, say V(x) > —ce-?-* for some 
€ > 0, assertion (a) would follow from Theorem 7. Under the present assumption, 
the argument is more complicated. We define V(x) := V(z) +cr~?. Since any 
w € CSUR’) belongs to D(Q™!) , we may use the second of the inequalities (8.4), 
which gives 


|, B@) (-Av - Fv) @ade = (, Hod) - FIQ¥IP > 0, 
and therefore 
(WH) = [B® (-A-40) Ay + Vw + 4c(-A- i)v) (x) dr 
> (1—4c) ae, (a) (-Ad + (1-4e)"'Vab) (x) dx > (1—4c) (W, Hew), 


where we have denoted H, := Ho+ (1—4c)~!V_(Q). Since |V_| < |V_| < |V|, 
the function V_ belongs to L?+ L™, and the same is true for (1—4c)~!V_. By 
Theorem 14.1.2, the operators H and H, are then self-adjoint with the common 
domain D(H) and C%°(JR*) is a core for both of them — cf. Problem 7.16. Using 
the fact that V(Q) and (1 —4c)~1V_(Q) are Ho-bounded, we readily check that 
the above inequality can be extended to any ~ € D(Ho) (Problem 10a). The 
minimax principle then implies 


fin(H) > (1—4c) un(He), n=1,2,..., 


and since o¢s,(H.) = [0,00), it is sufficient to check that the operator H, has a 
finite number of bound states. By assumption, the function (1—4c)~!V_ has a 
compact support, so it belongs to L?(IR), p € [1,2] (Problem 10b). Assertion 
(a) now follows from Theorem 7, or alternatively, from Theorem 5 (see the notes to 
Sec.14.1). 
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To prove (b) we again employ the fact that gess(H) = [0,00) ; in view of the 
minimax principle, it is sufficient to show that ,(H) <0 for any positive integer 
n. We shall employ Proposition 14.2.5, due to which we have to find to any N 
an N-dimensional subspace L C D(H) such that the projected operator PH } L 
has negative eigenvalues. We shall emulate the scaling argument of Example 7.5.8, 
taking da: ¢a(z) = a~9/?6(x/a@) for a unit vector da € C§°(IR*) , which we choose 
in such a way that supp¢ C {x: 1 <r <2}. If a >r, the hypothesis of part (b) 
implies 

($a, Hba) < a ?(¢, Hod) — dam***(g, Q-***9) ; 
hence there is an r; > 0 such that (¢2,H¢a) < 0 holds for a > r,. The sought 
subspace may be now constructed by means of the vectors ¢.. Putting Qn ‘= 
2"r, and wn := da, , we have (wn,Hyn) < 0 and (Yn, Hy) = 0 fork An 
since supp %, M supp ~, = @. Thus we choose L := {y,..., Ww }un for which the 
mentioned proposition yields 


N N 
pall gs sup 3 | Brel brag TF lip) cae Sub | = SUP (Un Hn) < 0 
n=1 n=1 gn 


for all n= 1,...,N ; this concludes the proof. § 


14.3.10 Example: The Coulomb potential V : V(r) = —y/r, y := Ze?, belongs 
to (L?+ L&)(IR?) and satisfies the assumption of part (b) for all r large enough; 
hence hydrogen-type atoms have an infinite discrete spectrum. This is not surpris- 
ing, of course, because the spectrum is known explicitly in this case: the eigenvalues 
are A, = —my?/2n? with the multiplicities n? , and we also know the corresponding 
eigenfunctions nem - 


The situation is less simple in higher dimensions, where we are interested pri- 
marily in the Schrédinger operators which play the role of Hamiltonians of atoms 
and molecules. We shall discuss the example of helium-type atoms; more compli- 
cated systems will be mentioned in the notes. If we neglect the spins and regard 
the nucleus as infinitely heavy, then we associate with a two-electron atom the 
Hamiltonian 


1 - = = 
Haga (P2 +P?) — Ze? (IQi\-!+ |Qa\-') + e71Qi- Qal™ 

on L?(IR®°), where Ze is the nucleus charge; we again use the notation Q,; := 
(Q31,Qj2,Q53), etc. To simplify the problem, we use the unitary “substitution” 
operator U : (Uw)(x1, 22) = (mZe?)->y ea Ba), which transforms H into 
UHU- = } m(Ze?)*H,, , where 


Hy = PR + Pz - 24Qi\7* = 21Q2\- + y1Qi— Qal7 (14.18) 


and + := 2/Z is a positive parameter. The operators H., are self-adjoint by Theo- 
rem 14.1.4; the physically interesting cases are 7 = 2, 1, 2/3, 1/2,... corresponding 
to H~, He, Lit, Bet*, etc., respectively. 
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14.3.11 Theorem (Kato): oa(Hy) is infinite if y < 2. 

Proof: By Examples 10 and 5.7.5, oess(H) = [—1,00) for y =0; the same is also 
true for a nonzero y due to the HVZ-theorem, which we shall discuss below. By 
analogy with the proof of Theorem 9b, it is then sufficient to find to any positive 
integer N an N-dimensional subspace L C D(H.) such that the eigenvalues of 
the “cut-off” operator are less than —1, i.e., 


sup{ (v, Hy) : PEL, |p|] =1} < —1. (14.19) 
Let woo: Wioo(x) = 7~/2e-" be the wave function of the hydrogen-atom ground 
state in the used system of units, (P?— 2|Q,|~!)wi00 = —wioo ; then we have 


(W100 ® , Hy(Yi00 ® ¢)) = —-1 + (4, (P?- (2—7)|Q2|7") $) 


+ ¥ (W100 ® ¢, (|Q1— Qz]~*— |Qal-) (vr00 ® 4)) 


for any unit vector ¢ € S(UR*). Using the fact that the function ~199 is rotationally 
invariant, we obtain 


(W100 ® 4, 1Q1— Qal- "(roo ® 4)) 
e ” dz |$(z2)|’ he da, min(ry*, rz *) [br00(x1)|? 


< ie rz '|b(z2) |? dz2 < (v00 ® , |Q2|-*(Yr00 ® ¢)) 


(Problem 11), so the last term in the above identity is nonpositive. In view of the 
preceding example the operator appearing in the second term has the eigenvalues 
An = —(2—7)?/4n?, n = 1,2,..., with the multiplicities n?. Let {Ey}, be 
the sequence obtained by ordering these eigenvalues (with repetition according to 
the multiplicity); the corresponding sequence of mutually orthogonal eigenfunctions 
will be denoted as {¢%}?2, (they differ from hydrogen-atom eigenfunctions by the 
above mentioned scaling transformation only, and therefore they belong to S(JR?) ). 
We define L := {y100 ® 1,.--, Vio ®@ On Sin ; then 


N 
(b, Hy) < —14+ > |oe|? Ex < —-l 


[=i 


holds for any wy = ihe axnW100 ® de, which satisfies the normalization condition 
~, |ax|? = 1, because all the eigenvalues EF, are negative. Using further the 
compactness of the unit sphere in L in the same way as in the proof of Proposi- 
tion 14.2.3, we arrive at the sought inequality (14.19). I 


Of course, finding the cardinality of the discrete spectrum is just a part of its 
analysis; one also has to study the distribution of eigenvalues, the behavior of the 
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eigenfunctions, etc. Such a task goes beyond the scope and purpose of the present 
book; the reader can find a lot of information on these subjects in the literature 
quoted in the notes. In conclusion, let us mention one often used result. 


14.3.12 Theorem: Let H := Ho+V(Q) be the operator (14.1) on L?(JR*) with a 
potential V € (L?+ L®)(JR°). Suppose that Ao := inf o(H) is an eigenvalue; then 
its multiplicity is one and the corresponding eigenvector Wo can be (with a suitable 
choice of phase factor) represented by a function which is everywhere positive. 


The lowest eigenvalue of a Hamiltonian is conventionally called the ground state; 
the theorem says that it is nondegenerate and the corresponding wave function has 
no nodes. This is true also many other Schrodinger operators (see the notes). 


14.4 The essential spectrum 


The investigation of the essential spectrum of Schrédinger operators again involves 
many problems; we shall mention just a few of them. Let us begin with the ques- 
tion of stability. The classical Weyl theorem of Problem 5.26 does not apply to 
Schrédinger operators with the exception of the trivial case, because the operators 
of multiplication by a function are not compact. Fortunately, for a wide class of 
potentials they are Hp—compact. 


14.4.1 Theorem: Let H := Ho+ V(Q) be the operator (14.1) with a potential 
V € (L?+ L&)UR"), n <3; then o¢.5(H) = (0,00). 

Proof: In view of Theorem 5.4.6 and Example 7.5.8, it is sufficient to check that 
the operator V(Q)(Ho+ J)! is compact. If V € L*, this follows from Proposi- 
tion 15.1.3a and the fact that g : g(p) = (p?+1)7! is square integrable. Furthermore, 
for any positive integer n there is a V, € L? such that ||V—Vallo <n! ; hence 


I(V—Va)(Ho+ 1)? =f 1V()—Vale)PU((Ho+ Nv) @)/Pd 


IA 


IV —Valleoll(Ho+ 1)*wl)? < 7? III)? 


holds for any w € L?(JR") due to Example 4.3.6. This means that the sequence 
{Vn(Q)(Ho+1)~'} converges to V(Q)(Ho+/)~! with respect to the operator norm, 
so the result follows from Theorem 3.5.2. If 


The result extends to Schrédinger operators on L?(JR"), n > 4, with potentials 
V e€ (L?+ L?)UR"), where p satisfies the condition (14.2), and other situations 
(see the notes). This assumption is valid, e.g., if V € LR. and lima. Vile) =0 
(Problem 1b). In the case that the limit exists and assumes a nonzero value c we 
may apply the result to the operator with the shifted potential V—c, obtaining 
Gess(H) = [c,0o). This suggests that the spectrum of Schrédinger operators with 
potentials, which grow at large distances, is purely discrete. 
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14.4.2 Theorem: Let H be the operator (14.1) on L?(R") with a potential 
V € L3.., which is below bounded and satisfies the condition limjz|-400 V(x) = 00 ; 
then o.,,(H) = 90. 
Proof: The operator H = Hj+V(Q) is symmetric and below bounded, and sin- 
ce Dy = D(V(Q)) D CSUR"), it is also e.s.a. due to Theorem 14.1.6. With- 
out loss of generality we may assume that V > 0 ; then we define the form 
s: sp] = ||Ho/7 bl? + ||V'/2wl|?. with the domain D(Ho)N Dy. It is closable 
by Proposition 4.6.7, and the self-adjoint operator associated with 5, i.e., the 
Friedrichs extension of H , coincides with H , because the latter is essentially self- 
adjoint. Due to Theorem 4.6.5b, for any w € D(s) := Q(A) there is a sequence 
{Wn} C D(A) such that vn > W and sl[¥n— yn] — 0 as n,m — oo. Since s isa 
sum of two positive forms, it follows that He’? (a— Wm)||? + 0 as n,m — oo, i.e., 
pe D(H’) = Q(Ap) . In this way, we have obtained the inclusion Q(H) C Q(Ap). 
Now we may use the minimax principle; it is sufficient to check that fin(H) — oo 
as n — oo. By assumption, for any b> 0 there is an r, > 0 such that V(x) >b 
holds for |z| > r,. The function 


~ nee z|>r 
Vs Wile) = | 0 Hos 


is bounded; hence the corresponding Schrodinger operator H, := Hot+ V,(Q) has the 
domain D(Hp), and similarly Q(Hy) = Q(Ho). It now follows from the inequality 
V(z) > b+ V(x) and Theorem 14.2.1 that 


Sil} = ‘st inf 8 > sup inf H,+ b)¥/2yI|2 
HoH) = sup pit ay MS LP, yeni gan lor oyu 
na? (l= 1 
os ue Hy+ b)"?wl|? = pn(Ho-+b) = b+ pn(Hb). 
_ Dee b € DL_, NQ(Ho) e+ 6) vl Un(Hp+ 6) Un( Ap) 
Web|] = 1 


The potential V, obviously belongs to L"/?(UR") , so it has a finite number of bound 
states due to Theorem 14.3.7. Hence there is an np such that un(Hy) > 0 holds for 
all n > ny, and therefore yn(H) > b for all n > np; this concludes the proof. 


The condition given by the last theorem is sufficient but not necessary. In the 
one-dimensional case, we have the following simple criterion (see the notes). 


14.4.3 Theorem (Moléanov): Let H be the operator (14.1) on L?(UR) with a 
potential V € L?..UR) which is below bounded. Then o-;,(H) = @ holds iff the 
conditions 


ats 
im Vig)dz = a 


are valid for any 6 >0. 


Let us now return to operators with a nonempty essential spectrum. Consider 


464 CHAPTER 14. SCHRODINGER OPERATORS 


the Hamiltonian of a system of N particles interacting via two-particle forces, 
a 


N 
oe PP 3” Y VinlQi— On). (14.20) 
j= 


mS. 


= 
i 
I 2 


In this case the 3N-dimensional version of Theorem 1 is of little use, because 
even if the full potential approaches a limit at large distances, the latter generally 
depends on the chosen direction; the bundle of lines z;— x, = const, along which 
this term is constant, corresponds to each two-particle potential V;, . The way to 
determine the essential spectrum of H is thus more complicated; however, it has a 
transparent physical meaning. At the same time, it is technically complicated, so 
we limit ourselves to the formulation of the result. 

First we shall ask how the system under consideration, which we denote as S, 
can be decomposed into subsystems. We assume N > 2. However, for N = 2 the 
following argument reduces to the comparison of the center-of-mass Hamiltonian 
from the decomposition (11.12) to the Hamiltonian of the free relative motion. If 
N > 3, there are different ways to decompose the system (four possibilities exist for 
N = 3, fourteen for N = 4, etc.). Let D:= {C),...,Cagpy} be such a partition 
of S into clusters of particles; we exclude the trivial case with n(D) = 1. We 
danate. I := {1,...,N} and define the function «? : I x I — {0,1} in such * way 
that c= mi. if éiie j-th and k-th particle belong to the same cluster and i= = 
otherwise. Then we can define the operator 


Hope >) PMS 3 © PVn(Q;— Qk) 


j=l k=1 


with a clear physical meaning: it describes the system in which we have “switched 
off” the interactions between the clusters. In a similar way, we define the Hamilto- 
nian of the cluster C; as 


Ho = Sat SD VulQs- OW): 
EC; j,k EC; 
j<k 
we know from Section 11.1 that Ho = Hc, ®1, where He, is the operator on 
I? (R®™) defined by the right side of the last fenton, Wie MP Dyn, | =" 15253), 
are now regarded as operators on L?(JR°™) ; we sloeniie here by n; the me af 
particles in the cluster C,. 

Let us stress that rather than in the spectrum of the Hamiltonian (14.20) itself, 
we are interested in the spectrum of the operator obtained by separating the center- 
of-mass motion, because only the latter contains nontrivial dynamical information 
about the system. Therefore let H** and Hyg! be the operators obtained after 
separation of the center-of-mass motion from H and Hp, respectively, and let 
Hé&' be the operator originating from the elimination of the center of mass of the 
cluster C; from Ho, (cf. Remark 11.5.1). The operators H'™, Ht act on the 
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space Hye: := L?(IR?N~%) , in which suitable relative coordinates are introduced, 
and H¢! acts on He! := L?(IR°%-3) determined by the internal coordinates of 


the cluster; if n; = 1 the corresponding relative state space is one-dimensional and 
Hee! is a zero operator on it. As above, we denote ae = He! Ql. 

Concluding this argument, we can say that pre) (3n;— 3) = 3N—3n(D) de- 
grees of freedom are associated with the motion within the clusters. The remaining 
3n(D)—3 degrees of freedom correspond to the relative motion of the clusters in 
the center-of-mass frame (since we assume n(D) > 2, this quantity is positive); 
the space L?(IR°*)-3) determined by these coordinates will be denoted as Hp. 
By analogy with Section 11.5 and Problem 11.13, one can prove the following result 
(see the notes). 


14.4.4 Proposition: The decomposition H,.: = Hp ® He Q::-@ HE D) holds 
for any partition D, and the corresponding operator Hf! is expressed as 


ne — Tp +H fees t HE » 


where Tp :=Tp®I and Tp is the operator on Hp describing the kinetic energy 
of the relative motion of the clusters C),...,Cnp(p) in the center-of-mass frame. 


The spectrum of H7%! is given by Theorem 5.7.4. Moreover, choosing the co- 
ordinates suitably (see the notes) one can check that o(Tp) = Rt ; it follows that 
o(H7$') = [tp, 00) where 

n(D) 
in s= >| inf o(Be). 


i=] 


Now we are able to state the mentioned result. 


14.4.5 Theorem (HVZ-theorem): Let H be the Schrédinger operator (14.20) with 
two-particle potentials V;, € (L?+ L°°)(R?). Then os.(H) = [t,0o) , where 
tp = n(D)a2 


t:= tp. 


min 
n(D)>2 


After we have characterized the essential spectrum of Schrédinger operators, we 
have to say something about its subsets, such as the absolutely and singularly con- 
tinuous spectrum or possible embedded eigenvalues. Some results may be derived by 
tools adopted from scattering theory. As an illustration, let us mention an assertion 
which we shall prove in the next chapter (cf. Example 15.3.2 and Remark 15.3.3). 


14.4.6 Proposition: Let H := Ho+ V(Q) be the operator (14.1) on L?*(UR3) 
with a potential V € L?,.(JR?) which has the following property: there are positive 
C, R,€ such that |V(zx)| > Clz|~'~* holds for |z| > R. Then oa,(H) = (0,00). 


Let us now turn to the singularly continuous spectrum. In Section 15.1 we 
shall see that vectors from the corresponding spectral subspace give rise to states 
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with pathological scattering properties; this suggests that we should seek conditions 
under which o,-(H) = 0. Let us first mention the following general result. 


14.4.7 Theorem: Let H be a self-adjoint operator on H. If the condition 
B 
sup jh [Im (, (H—A—ie)“h)PdA < 00 (14.21) 
O<e<l1 Ja 


holds for a bounded interval (a,) C IR, some p> 1, and a vector 7 € H, then 
Ex(a, B)w € Hac(H) . 

Proof employs the Stone formula of Example 5.5.2. We denote J := (a, G) and use 
the fact that any open set G C J can be expressed as an at most countable disjoint 
union Uxer Je of open intervals. We have Ey(J,x) < Exn(J,) for any k € J, and 
therefore 


Dee: a 
(J, Eu(Je¥) < = Jim _ Im (, (H—A—te)"") dd. 
Next we denote J‘) = Uf_, Je and use the Hélder inequality with p-!+q7!=1 


obtaining 


1 . 
(b, Ex(J™)b) < = jim, st Im (a, (H —A~ie)~'p) dd 
1 ; 1/p 
SF dim, (fry | (s (H— Amie) "wy dn) IY", 


where €(-) is the Lebesgue measure. If the index set J is finite, this estimate 
together with the inclusion G Cc J gives the inequality 


(p, En(G)v) < C(J,p) (G)”", 


where the number 
1 1/p 
Omics ( ij Iaith (H-A-ie"'y)P aa) 
wT  e—0+ J ‘ 


is finite by assumption. The same is true for a countable J in view of Proposi- 
tion 5.1.1, the inequality C(J™,p) < C(J,p), and the o-additivity of &(-). To 
prove that the measure (w, Ey(-)W~) is absolutely continuous on J, we have to 
check that (~, Ey(M)w) = 0 holds for any é-zero set M C J. Since the Lebesgue 
measure is regular, for any positive integer j there is an open set G; > M such 
that €(G;) < j~'; hence 


(b, Eu(M)d) < inf (v, En(G;)¥) < C(J,p) inf &(G;)”" = 0 
follows from the above inequality. 


Let us present an illustration of how the proved theorem can be applied to 
single-particle Schrédinger operators. 
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14.4.8 Theorem: Let H := Hy+V(Q) be the operator (14.1) on L?(JR) with a 
potential V € (L'n L?)UR?) and 2m =1; then the condition 


1/2 
Vile = (he as d i) < 4n (14.22) 


implies o,.(H) = 0 

Proof: The operator H is self-adjoint by Theorem 14.1.2 and D(H) = D(Ap). 
The set L'NL? c L¥/? is contained in R (cf. the notes to Sec.14.1), so the integral 
on the right side of (14.22) is finite. We need to estimate the resolvent of H. To 
this end we use the integral-operator representation of |V(Q)|'/2(Ho— z)-!V(Q)!/? 
mentioned in Remark 4.3.4; with the help of Problem 7.18 we easily estimate its 
Hilbert-Schmidt norm, 


I V(Q)P(Ho~ 2)"V(Q)"le < = IVIIn <1 


for any z € (\ JR*. It follows then from Theorem 3.6.2 and Lemma 1.7.2 that the 
operator 1+|V(Q)|!/2(Ho— z)~!V(Q)!/”? is invertible and its inverse has a bound 
independent of z, namely 1—(47)7!||Vilz. Next we use the modification of the 
second resolvent identity from Problem 1.63, which gives 


(H = Zz) = (Ho— ie = (Ho — z)~!V(Q)? 


x [141V(Q)2 Ho 2)2V(Q)!?]" 1V(Q)P? Ho- 2)". 


Consider an arbitrary ¢ € Co°(IR?) and denote f := ||!/2. Since it has a com- 
pact support, the function f belongs to R, and we can check in the same way as 
above that the inequality |/T;(Ho— z)~'Ty|| < Ci := (4m)7"||fllx is valid for the 
corresponding multiplication operator T; and any z € €' \ Rt. It is a little more 
complicated to estimate the mixed products: we have 


|T¢(Ho- z)*V(Q)"" I 


: be fee ae ba f(z)|V Yl + FIV (@)| i, 
|x— yl? IR6 |x — yl? 
ee Te) VE siti + 5 IVR 
2 Jims |e | 


because the function |V(-)|—f(-) is real-valued (cf. Problem 9b). If we denote the 
right side as C?, we obtain {|T;(Ho— z)~1V(Q)'”?|| < Co; in a similar way, we can 
find || |V(Q)|!/2(Ho— z)7!T;|| < C2. Putting the estimates together, we obtain 


7;(Ho— z)“!Tyl| < Cr + CZ (1 - (4m) [IV la) 
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for any z€ €'\ Rt. Since ¢ = T;¢'/?,, where $'/? := ||1/2sgn ¢, it follows that 
(¢, (Ho z)7"¢) = ($'/?, Ty(Ho- z)'Tj6'?) < Ci + C3 (1-(40)""IIV lle) lor’ |)? 


This means that the function (A,€) + (¢,(Ho— A—ie)~'¢) is uniformly bounded 
in {a, 6] x (0,1) for any a, 8 € IR; Theorem 7 then implies Ey(a, 8)? € Hac(H) 
for all a, 6 € RR, i.e., ¢ € Ha-(H). In this way, we have checked the inclusion 
CSIR?) C Hac(H), and since the set on the left side is dense in L?(JR*), we 
conclude that H-(H) = L?UR°). # 


14.4.9 Remarks: (a) In fact, we have proven a stronger result, namely that the 
spectrum of H is purely absolutely continuous. Recall that condition (14.22) 
ensures, due to Theorem 14.3.5, that H has no bound state, i.e., op(H) =9. 
Then the operators H, Ho are unitarily equivalent by means of the wave 
operators 2(H, Ho) , which we shall introduce in the next chapter. 


(b) The singularly continuous spectrum is difficult to investigate because it is 
highly unstable. While 0.3, and oa. are stable with respect to wide classes 
of perturbations (including unbounded operators — see Theorem 5.4.6, The- 
orems 15.2.6-10 and Remark 15.3.3), o,- can be changed by adding a one- 
dimensional projection (see the notes). Nevertheless, rather strong sufficient 
conditions for the absence of the singularly continuous spectrum have been 
proven; in Section 15.3 we shall mention two of them without proof. 


Another question to which we should pay attention is whether o¢s.(H) may con- 
tain embedded eigenvalues. Note first that an eigenvalue may occur at the boundary 
of the essential spectrum in a rather natural way, even if the potential is purely at- 
tractive (Problem 13). It is less trivial that an interior point of o.,,(H) may be an 
eigenvalue. 


14.4.10 Example (Wigner - von Neumann): Consider the Schrédinger operator 
H on L?(R*) with the centrally symmetric potential 
‘ 32 sinr 2 a 
Vs V(x) = — Es (U(r) 14+ S0r)?) cose + (1=3/(7)")sin*r], 
where we set f(r) := 2r — sin(2r). The right side is a smooth bounded function 
of r; hence H is self adjoint. Moreover, the potential behaves at large distances 
as V(x) = 8 sin(2r) + O(r-*), so aess(H) = (0,00) due to Theorem 1. By a 
straightforward computation, we can check that // has a positive eigenvalue, 
r7'sinr 
Tigh = ap, = ——.. 
v= 9, ¥@) = Tap 


The existence of a positive eigenvalue in the example is due to the clever choice 
of the family of potential barriers which prevent the particle escaping to infinity. It 
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is essential that the oscillating potential is slowly decaying, so the following result 
does not apply (see the notes). 


14.4.11 Theorem (Kato): Let H := Hp+V(Q) be the operator (14.1) on L?(UR”) 
with a potential, which satisfies the following conditions: 


(i) V € P+ L™, where p satisfies condition (14.2), 


(ii) there is zero-measure set M C JR” such that IR" \ M is connected and V is 
bounded on each compact subset of R"\ M, 


Gii) lim... |z|V(z) = 0. 
Then H has no positive eigenvalue. 


We shall describe one more method which allows us to exclude the existence 
of positive eigenvalues. It is based on the scaling transformation which we have 
already employed in Theorem 14.3.9. Using the unitary operator Ug: (Uaw)(x) = 
a”/2(ax) , we define V.(Q) := U.V(Q)U;! for a potential V ; it is the operator 
of multiplication by the function z+ V(az). 


14.4.13 Theorem (the virial theorem): Let H := Ho+V(Q) be the operator (14.1) 
on L?(JR") with a potential which satisfies the assumptions of Theorem 14.1.2 or 
Theorem 14.1.4. Suppose further that there is a measurable function f : IR" > R 
such that D(T;) > D(Hpo) , and 


FA V..(Q) x V(Q) 


asl Ci Il 


y= Typ 


holds for any w € D(Ho). Let A be an eigenvalue of H corresponding to an 
eigenvector w ; then 


2(p, Ho) = 200, (A-V(Q))¥) = (Y, Ty). (14.23) 


Proof: H is self-adjoint with D(H) = D(H») by Theorem 14.1.2, and it is easy 
to check that U,H)Uz! = a~?Ho. The last identity together with the assump- 
tion Hy = AW implies (Hp + a?V.)Uaw = rAO7U.W. Since the operators under 
consideration are symmetric, we infer that 


Aa? —1)(Uab,b) = ((Ho+ °Va(Q))Uay,v) — arb, (Hot V(Q))v) 


(a? — 1)(Ua, V(Q)b) — 0° Ua, (V(Q)—Va(Q))¥), 


ve. 


Va(Q)—V(Q) ’) ; 


a—l 


NMa+1) Vath ¥) = (a+1) Vath, V(Q)) + 0? (vw, 
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By Problem 10.12, the dilations form a continuous group on L? UR"), so Ua > w 
as a@ — 1. The limit of the second term on the right side exists by assumption; 
together we obtain 2A||2)||? = 2(v, V(Q)v) + (wv, Ty#), i-e., identity (14.23). I 


14.4.13 Remark: In view of Example 5.9.5b and Problem 10.12, the dilation group 
{Ug : a € (0,00) } is generated by the (closure of the) operator 


On a formal level therefore, the definition of Va(Q) gives Ty = i[D,V(Q)], and 
the right side can be expressed by means of the canonical commutation relations as 
i[D, V(Q)] = Xhe1 Q;(VjV)(Q). Another formal manipulation yields the identity 
i[D, Ho] = —2H, 1.e., 


2(, Hot) = —i(¥,[D, H]¥) + id, [D, V(Q)|v) = [» Eaxv.vier| | 
j=l 


As usual, the danger is hidden in the fact that the set of vectors, on which this 
reasoning makes sense, is too small. Nevertheless, the argument shows that 


" 


f+ f(a) = 2(V;V)@) 
j=! 
is a natural candidate for the role of the function f appearing in the theorem. To 
check that it satisfies the assumption, it is sufficient to find a function g which 
satisfies the condition D(T,) C D(Ho) and majorizes (V.—V)(a@—1)7! in the 
vicinity of the point @ = 1, and to prove the existence of the limit by means of the 
dominated-convergence theorem. This is particularly easy if V is a homogeneous 
function of order —B, i.e., V(ax) = a~*V(z) for all x € R"; then f =—AV. 


14.4.14 Corollary: Let the potential V satisfy the assumptions of Theorem 13 
and at least one of the following conditions: 


(i) V is repulsive, t.e., V(ax) < V(x) holds for all x € R" and a> 1, 
(ii) V is homogencous of order — for some (6 € (0,2). 


Then H has no positive eigenvalue. 


Proof: \f w is an eigenvector, Hy = Ay), relation (14.23) is valid. In case (i) we 
readily check that f < 0, so (W,Typ) < 0; and since (~, How) > 0, it follows 
that (~, How) = 0. However, the spectrum of Ho is purely continuous; hence 
Ker Ho = {0}, t-e., w= 0. In case (ii) we have f = —BV , and therefore 


2x? = 2p, Hoy) +20, V(QW) = 2-B)(, VW) 
= © wt) = Sw, How), 
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where we have repeatedly used the relation (14.23). The coefficient on the right side 
is negative by assumption and (7%, How) > 0 holds for a nonzero w ; hence the last 
identity cannot be validif A>0. 


14.4.15 Example: Let H be the atomic Hamiltonian (14.4). Since Coulomb 
potentials are homogeneous functions of order —1, the corollary implies that H 
has no positive eigenvalues; the same is true for Hamiltonians of molecules, atomic 
and molecular ions, etc. Let us stress that in spite of this result, oes,(H) may 
contain embedded eigenvalues, because its threshold is negative for multiparticle 
systems with attractive potentials, if only they have at least one bound subsystem 
(see the notes). 


14.5 Constrained motion 


Up to now we have considered systems whose configuration space was JR” with the 
dimension given by the number of degrees of freedom; the probability of finding the 
particle(s) in a particular region was determined by the interaction encoded in the 
potential. Now we want to mention the situation when the motion is constrained to 
a subset of the configuration space. 

To motivate the question, recall that if the potential forms a sharply distin- 
guished well and the particle energy is well below its bank, the eigenfunctions decay 
rapidly in the classically forbidden region -—- cf. Example 7.2.5. It is therefore useful 
in some situations to simplify the problem and to assume that certain regions are 
not accessible at all, i.e., that the value of the potential is “infinite” there. This 
is an idealization, of course, because any potential is in fact a sum of elementary 
interactions (Coulomb, spin-orbit, etc.), so even finite-depth rectangular wells do 
not exist in nature. On the other hand, in some cases “sharp” constraints represent 
a reasonable approximation, which is, moreover, often easier to solve than a more 
realistic model. In statistical physics, restriction to a bounded region represents a 
basic ingredient of the theory — see Example 12.3.8. 

The subject again includes numerous problems, and we limit ourselves to the 
presentation of several simple results concerning free motion within an open con- 
nected region G C JR”. First we have to ask about the self—adjointness of the 
corresponding Schrédinger operator. This question is now more complicated, be- 
cause there are different candidates for the role of the free Hamiltonian connected, 
roughly speaking, with the boundary conditions we impose on 0G := bd(G). We 
shall describe one frequently used class. 

Recall that C9°(G) is a dense subset in L?(G), which consists of infinitely 
differentiable functions 7%: G— € whose supports are contained in G ; moreover, 
since G is open by assumption, the distance of suppw to OG is positive. The form 
t defined by 


(6,8) == » ie on PU) te, we ORR), (14.24) 


4 
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is thus densely defined; it is obviously symmetric and below bounded. We also 
introduce the operator Hp with the domain Cg°(IR") by How := —Ayw. Since the 
functions of ¢,w € Co°(IR") and their partial derivatives belong to L?(JR") , and 
the same is true for their extensions to a cube in JR” containing supp ¢ U supp w 
which are zero outside the supports, the Fubini theorem may be applied; integrating 
by parts, we obtain 

(¢, How) = t(d, p) a (Hod, v) 


for any ¢,~ € Co°(R"). This means that the operator Ho is symmetric and the 
form t generated by it is closable due to Proposition 4.6.7. Then there is a unique 
self-adjoint operator associated with the closure ¢, i.e., the Friedrichs extension of 
Ho ; we call this operator the Dirichlet Laplacian referring to the region G, and 
denote it as —A% or H§. To understand where the name comes from, consider a 
few simple cases. 


14.5.1 Examples: (a) Let n = 1 and G := (a,b) C R; then H§ is by Prob- 
lem 4.40b the operator corresponding to the differential expression —d?/dx? 
with Dirichlet boundary conditions at the endpoints of the interval, 7.e., Tp 
in the notation of Problem 4.61. 


(b) Consider next a parallelepiped G := Xf.,J; in IR", where J; := (a;,6,). 
It is sufficient discuss the case n = 2 ; the extension to higher dimensions 
is straightforward. We denote H, := Hj and define H = H, +H. By 
Theorem 5.7.2, this operator is self-adjoint. Moreover, Example 5.7.5 and 
Problem 4.61c show that the spectrum of H is purely discrete, consisting of 
the eigenvalues Aj, := ia) + ia he. jik= 1/2,..., where £Z, = d,—1g;. 


The corresponding eigenfuctions are xj” := ad ® me , where ve are the 


(trigonometric) eigenfunctions of H;. By Example 5.3.lla, the operator H 
is e.s.a. on D := {xjx}un- 


The set C™(G) consists of functions w, which are infinitely differentiable in 
G and have a finite limit at each point of OG ; the latter define the function 
OG + C€, which we denote as ~ OG. Then we can introduce the set 


D := {peEeCc™(G): y\dG=0}; (14.25) 
in the same way as above, we can check that the operator H') : HO := 
—Ay with the domain D(H‘) := D is symmetric, and as a symmetric 


extension of H}D it is e.s.a. Its closure coincides with the corresponding 
Dirichlet Laplacian, i.e, D is a core for H§ ; since H) extends Ho, by 
Theorem 4.6.11 it is sufficient to check that D is contained in Q(H§) := D() 
(Problem 14a). 


(c) The explicit form of the operators also allows us to say something about the 
distribution of the eigenvalues. Let us denote 


NY(G) := dim Ey(—00, A) 
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for H = H§. In the one-dimensional case, the eigenvalues of H§ are 
(1j/€)*, j = 1,2,..., where £:= b—a; hence NP(G) = N holds provided 
N< £va < N+1. This means, in particular, that limy.oN?(G)A7/? = 
£/x. Similar conclusions can be made for n > 2 (Problem 14b). 


In this way, we have in this particular case obtained an expression for the Dirich- 
let Laplacian through an explicitly known operator with a purely discrete spectrum; 
the same can be done for some other simple sets G (Problem 14c). This may not 
be true for a general G, but the essential conclusions of the example remain valid 
provided the boundary is not too wild (see the notes). 


14.5.2 Theorem: Let G be an open connected set with a smooth boundary or a 
polyhedron in R”. 


(a) The subset D..(G) := {~ € D: —Ay € L?(G)} of (14.25) is a core for the 
Dirichlet Laplacian H§. 


(b) In addition, if G is bounded, then D,.(G) = D, the spectrum of H§ is purely 
discrete and Rae We 
fin NPC) _ vavol(G) 


where vy, := 21%? (nr @e)~ and vol(G) is the Lebesgue measure of G. 


If the the region G is unbounded, the continuous spectrum is generally nonempty; 
in some cases even the spectrum is purely continuous. However, caution is needed 
when we draw conclusions about the spectrum from the shape of G. We shall 
present one example which shows once more that an intuition based on our every- 
day “macroscopic” experience can be a false guide when we are dealing with objects 
governed by the laws of quantum theory. 

The system we are going to consider is a free “two-dimensional” particle whose 
motion is confined to a strip S C JR? of width d ; this represents the simplest 
example of a quantum waveguide. Since the mass is not essential in the following, 
we put 2m = 1 and choose the Dirichlet Laplacian Hj? as the Hamiltonian of 
the particle. As long as the strip is straight, it is easy to analyze the spectrum of 
H®: separating the variables we find that o(H ») is purely continuous and equal to 
[x?,00), where «, :=7/d (Problem 15a). Let us see what will happen if we bend 
the strip. 

First we have to characterize a bent strip geometrically. We limit ourselves to 
the situation when the boundary of S is smooth; then we choose one of the strip 
boundaries as a reference curve and denote it as [. The Cartesian coordinates of 
a point (z,y) € I can be described by a pair of functions &€(-), 7(-) ; if we choose 
the arc length s of the curve as the parameter, they must satisfy the condition 
E'(s)?+ 7/(s)? = 1. This allows us to define further the signed curvature of I. by 


4(s) = 1(s)§"(s) — &(s)n"(s) ; 
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this function in turn determines the curve [ uniquely up to Euclidean transforma- 
tions of the plane (Problem 15b). Each point of the curved strip is now characterized 
by its distance u from I measured, of course, on a normal to the curve, and by 
the point s of I at which the normal is taken. The pairs (s,u) define curvilinear 
coordinates on S, through which the points of the strip are expressed as 


z = €(s)—un'(s), y = n(s)+ué"(s). (14.27) 


This defines a map f : IR x (0,d) — S whose Jacobian is easily found to be 
Dy = 1+1u7(s). The map is regular provided the right side is positive. This 
restriction can be easily understood: at the points where y(s) assumes negative 
values, I is the outer boundary of the strip; then the width must not exceed the 
curvature radius, which is |7(s)|~?. 

With these preliminaries, we are ready to discuss the operator H# for the curved 
strip. First we define the map U : L?(S) — L?(JR) @ L?(0,d) by 


(Up)(s,u) = ¥1+uy(s) (vof)(s,u) - (14.28) 


It is unitary by Example 3.3.2, and a straightforward computation (Problem 15c) 
yields the following result. 


14.5.3 Proposition: Let y € Cg°(IR) and infsem(s) > —d~'. Then the operator 
H := UH3U~' is e.s.a. on Doo(JR x (0,d)) and its action there is given by 


2 
Hy = ~ 2 atu? - = + Vy, 
(14.29) 
Waren ¥(s)? uy"'(s) 5 u?y(s)? 


~ ~ a+uy(s))? — 2+ur(s)y? 4 urls) 


Hence we have transformed the problem to the investigation of an operator 
referring to a simpler region (a straight strip); the price we pay for this is the 
more complicated structure of the operator itself. Apart of the weight factor in the 
“longitudinal” part of the kinetic energy, the transformed operator has acquired a 
curvature-induced potential V. This fact has an important consequence. 


14.5.4 Theorem: Let 7 satisfy the assumptions of Proposition 3; then 
(a) Gess(H) = [K?, 00), where x, := 7/d, 


(b) if the strip is curved, y # 0, then there is a positive critical thickness do such 
that // has at least one isolated eigenvalue in [0,x?) provided d < dy. 


Proof: Denote ys := + supyem(+7(s)) ; then y, > y_ > —d7! holds by assump- 
tion. We want to estimate the operator H. To this end we consider the Dirichlet 
Laplacian H, := Tp on L*(0,d) from Example 1a and the Schrédinger operators 


H® = (1+dy4.)~?P? + Vi(Q) 
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on L*(R), where Vs are bounded potentials to be chosen below; the indices 
s,u refer to the coordinate considered. Next we construct the operators Hz := 


* @1+1@ 4H, ; they are e.s.a. on C§° UR) Do(0,d) due to Theorem 5.7.2. 
The last named set is contained in D := DUR x (0,d)), and since Hy are easily 
seen to be symmetric on D, all the three operators have a common core. Let us 
now choose the potentials Vs; in the form 


vs? dl") 5 PY (s)? 


Vals) = — 4(1+dys)2 ~ 2(1+dy_)? ~ 4 (+dy_)4’ 


then 
(b, H_p) < (¥, Hy) < (¥, yy) 

holds for all ~ € D, and therefore for any vector of the common domain of these 
operators. This makes it possible to use the minimax principle. First of all, the 
potentials Vs are bounded and have a compact support, so dess(H‘*’) = [0, 00). 
Then Problem 4.61 and Example 5.7.5 give cess(Hs) = [k?,00) and assertion (a) 
follows from Theorem 14.2.1. 

The same argument will prove (b) if we are able to check that H, has an 
eigenvalue below «? ; for this it is in turn sufficient to show that H@) has a 
negative eigenvalue. Theorem 14.3.6 cannot be applied directly because Vi may 
not be smooth and purely attractive; however, we have mentioned in the notes to 
Sec.14.3 that the result is valid under some regularity conditions (which are satisfied 
here) as long as fp V;(s) ds < 0; this condition is obviously satisfied for all d small 
enough. &f 


The existence of such curvature—induced bound states in strips and similar struc- 
tures can be proved under much more general circumstances. This has interesting 
physical consequences; we comment on these problems briefly in the notes. 


14.6 Point and contact interactions 


It is sometimes useful to assume that the interaction is strongly localized in the sense 
that the particle behaves as free outside some small region of the configuration space. 
If we shrink this region to a zero measure set, then the considerations of the previous 
sections yield trivial results only, because such a potential belongs to the same L? 
class as V = 0. In this section we are going to show that one can nevertheless 
construct a wide class of generalized Schrédinger operators which correspond to 
such situations. 

Let us start with the simplest example. Consider the free—particle Hamiltonian 
on line, Ho := P? of Example 7.2.1, and choose a point y € IR. Let Ay, be the 
restriction of Hg to the subspace {7 € AC?(UR) : u(y) = 0}. In the same way as in 
Section 4.8, we can check that the adjoint operator is again given by the differential 
expression —d?/dx? and its domain is {~: ws € AC?(J*), vy! € L?UR)}, where 
we have denoted Jf := (y,00), Jy := (-oo,y), and ys := wh die The equation 
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(Aj — z)b = 0 then has for any z € C \ IR a unique solution up to a multiplicative 
constant, namely 7(z) := e‘V#l?-¥! with the cut in the square root taken along 
the positive real axis, i.e., the deficiency indices are (1,1). The corresponding 
self-adjoint extensions and their properties can be found easily (Problems 17a-c). 


14.6.1 Proposition: For any a, —0o < a < oo, there is one self-adjoint extension 
Hay of Ay which is the restriction of Aj to 


D(Hay) = { € D(A): Wt) — (y-) = ody) } - (14.30) 


In particular, a = 0 corresponds to the free Hamiltonian Ho, while a = oo 
refers to the Dirichlet boundary condition w~(y) = 0 ; in this case the operator 
decomposes into an orthogonal sum, Hoy = A” ® a , where Af are the 
Dirichlet Laplacians on J? . 


We shall call Hay the 6-interaction Hamiltonian supported by the point 
y and with the interaction strength a. To find a correspondence between it and 
Schrodinger operators, consider a one-parameter family H.(V,y) defined for a given 
V and € € (0,00) by 


Heda = PF? + YO). ae) = : V (=) (14.31) 
The family of the scaled potentials V. “shrinks” as € — 0 preserving the integral; in 
the sense of distributions it converges to a multiple of 6(-—y). It is therefore natural 
to ask whether H,, is in asense a limit of the operators H.(V,y) , so that it can give 
a rigorous meaning to the formal Schrodinger operator — £ +aé(xr—y) . However, 
since we have not introduced convergence of families of unbounded operators, it is 
necessary to replace them first by suitable bounded functions. 


14.6.2 Theorem: Let V € Cp°UR). Anumber z € p(Hay) belongs to p(H-(V, y)) 
for all € smal] enough and the corresponding resolvent converges in the operator- 
norm sense, 


u-lim (H-(V,y) — 2)"' = (Hay — 2)", (14.32) 


where the interaction strength is a@:= fp V(x) dz. 


Proof: The resolvent of Hay can be expressed by means of the Krein formula (Prob- 
lem 17c). On the other hand, the resolvent of H.(V,y) is given by Problem 1.63 
as in the proof of Theorem 14.4.8; using the notation introduced in Section 14.3 we 
can write it as 


(H.(V, y) = i = (Ho = kaye = (Ho—k?)“"V.(Q)? 


x [1 +|V.(Q)|"?(Ho—k?)"V.(Q) 7" |Ve(Q)|!/?(Ho—k?)-) 


where k := ,/z. It is thus sufficient to check that the second term on the right side, 
which we denote as Dx, , converges in the operator norm to the rank-one operator 
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spat (GC, ¥),-) G;(-,y). The operator in question is a composition of integral 


operators with known kernels; changing the integration variable z’ to (x’— y)/e, 
we can rewrite it as Dye = Bre(I+Cre)~' Br, where the involved operators are 
determined by their kernels, 


Bye(zyz') = G,(z,y+ex')V(e')'?7, By e(z, 2’) = |V(x)|'Gr(y+ez, 2’), 
Cre(z,2') = |V(x)|'?Gi(y+exr,yter’) V(a')? 


II 


(Problem 17d). We want to show that they converge to the integral operators with 
the kernels 


G.(z, y) V(2')'? ? B(x, a) = \V(z)|/?Gi(y, 2’) ? 
|V(x)|?Gx(y, y) V (2)? 


BeG@rz') = 
Cx(aya') : 


in the Hilbert-Schmidt norm, and thus a fortiori, with respect to the operator norm. 
Since the function G;(-,-) is continuous, the kernels converge pointwise; then the 
assertion will follow from Theorems 3.6.5 and A.3.7 if we are able to majorize the 
kernel difference by a square integrable function independent of ¢. By assumption, 
there is N such that supp V Cc (—N,/N) ; using the explicit form of G, we readily 
check that 

|Buela,2’) — By(a,2/)/? < ||V]2, e202) 


holds for any z,z’ € JR and € > 0. The convergence for the other two operators 
can be proven in a similar way. 

Hence Dx. converges to the operator De := By(I +C,)~1B, provided —1 is 
not an eigenvalue of C;. Since we know the last. named operator explicitly, we can 
express ([+C;,)~! ; using Problem 1.63 once more we obtain 

a 
1 2a es ee eee 
( ay k) 2k + i(V1/2, |V|12) | | ( ’ ) 
This shows that the condition is satisfied if the denominator is nonzero, i.e., if 
a:= (V'/? |V|)?) 4 2ik. Finally, the above identity together with the explicit form 


of By, Bg yields the sought result, Dy = — (G.(-,9),:) G.(,y). 4H 


14.6.3 Remarks: (a) Since the analytical structure of the resolvent determines 
the spectrum, the proved result suggests that for € small enough the approx- 
imating operator has just one eigenvalue if a := fp, V(x) dx <0, and that it 
approaches —a?/4 as € — 0; this can actually be proven (see the notes). To 
grasp the meaning of this result, note that H.(V,0) is by Problem 17d unitar- 
ily equivalent to the e~? multiple of the operator P?+¢V(Q) , which has for 
small € a single eigenvalue whose asymptotic behavior is — }(ea)? + O(e°) 
as mentioned in the notes to Section 14.3. On the other hand, there is no 
eigenvalue for a > 0, while in the case a = 0 the operator H,(V,y) has a 
negative eigenvalue for small € but it disappears in the essential spectrum in 
the limit. 
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(b) The 6-interactions constructed above do not represent the most general class 
of point interaction for a particle on line. If we restrict Hp to the subspace of 
AC?(IR) , which consists of functions ~ such that ~(0) = y’(0) = 0, then the 
deficiency indices are (2,2) due to Example 4.8.5 and Problem 4.56, and there 
is a four-parameter family of self-adjoint extensions. It includes, of course, 
6-interactions. Another important one-parameter subclass are the so-called 
6'-interactions (supported by a point y and with an interaction strength 
8), which are specified by the boundary conditions 


Vyt) =V(y-) = Vy), vyt+) —v(y-) = BV); (14.33) 


the corresponding self-adjoint operator will be denoted as H;, (cf. Prob- 
lem 18). In contrast to the preceding case, 6’-interactions are not approxi- 
mated by P?+V, but rather by P?+ B., where { B.: € > 0} is a suitable 
family of rank—one operators (see also the notes). 


The proved result thus provides a rigorous background for the heuristic idea of 
the Schrédinger operator with a 6-function potential. Recall that such Hamiltonians 
are typically introduced in order to the describe the behavior of particles which are 
very slow, and therefore the region to which their wave functions are localized in 
the configuration space is large in view of the uncertainty relations. If such an 
“extended” particle interacts with a potential supported by a much smaller set, it 
“sees” it as a point interaction with an averaged interaction strength — this is the 
physical content: of Theorem 2. 

It is natural to ask whether similar point interaction Hamiltonians can be con- 
structed in higher dimensions. Since the method used above was based on con- 
structing self-adjoint extensions to a symmetric operator, obtained by restricting 
the free Hamiltonian to functions, which are zero together with their derivatives at 
the points, which are supposed to support the interaction, it is necessary that such 
a restriction is not e.s.a.; this shows that the construction does not work if n > 4 
(Problem 19a). On the other hand, in the cases n = 2,3 the deficiency indices 
of a one-point restriction are (1,1) (Problem 19b), so there exist one-parameter 
families of point interaction Hamiltonians. 

We shall describe briefly the three-dimensional situation; the case n = 2 which 
can be treated analogously is left to the reader (Problem 20). Without loss of 
generality, we may assume that the point interaction is supported by the origin of 
the coordinates. The starting point is then the partial-wave decomposition (11.20) 
of the free Hamiltonian Hp ; for simplicity, we put again 2m = 1. By Problem 19b, 
the operators he, £> 1, are e.s.a. on CSUR \ {0}) while the deficiency indices of 
ho are (1,1). Self-adjoint extensions hog := T(4ma) of this operator are known 
from Examples 4.9.6 and 7.2.2; they are specified by the boundary condition 


f'(0+) — 4raf(0+) = 0, (14.34) 


where —oo < @ < oo. In the case a = 00, (14.34) is replaced by the Dirichlet 
condition f(0+) = 0, which corresponds to the free Hamiltonian — cf. Problem 7b. 
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14.6.4 Theorem: The operator Ho } CSUR? \ {y}) has for any y € RR? a one- 
parameter family of self-adjoint extensions given by 


Flies, cer las (Fas ® Io ® <p) he ® i) UevP ’ 
£=1 


where U is the operator (11.16) and y- P := Oy Any self-adjoint ex- 
tension of Ho} Cs°(R® \ {y}) coincides with Hy for some a € (—00, 00]. The 
essential spectrum is the same as in the free case, Gess(Hay) = Tac(Hay) = Rt 
while o,.(Hay) = @. If a < 0, there is one simple eigenvalue, namely —(47a)?, 
corresponding to the normalized eigenfunction 


ettalz—yl 


Ya: Yo(x) = Va 


zo) 
otherwise the point spectrum of Ha, is empty. 


Proof: The above argument together with (11.20) proves the first part for y = 0; 
the general result follows from the fact that due to (10.1) and Example 10.2.1b, 
e'vP is the operator of (active) shift on y. Theorem 4.7.14 implies the relation 
Oess(Hay) = IRt , and by Proposition 4.7.12, Hay has at most one eigenvalue; it 
is found by solving the radial Schrédinger equation with the boundary condition 
(14.34). Finally, os-(Hay) = @ follows from the explicit form of the resolvent in 
combination with Theorem 14.4.7 (Problem 21b). & 


The operator Hy, is again called the Hamiltonian with 6—interaction of the 
interaction strength a@ supported by the point y. In asimilar way, one can introduce 
point interaction supported by a finite subset of IR*, or even an infinite subset 
provided it has no accumulation points; these Hamiltonians are again approximated 
by suitably chosen families of Schrédinger operators (see the notes). Hence they 
have the same physical meaning: they model an interaction of slow particles with 
potentials supported by regions much smaller than the wave function spread. This 
mode] simplification brings a significant computational advantage. We have seen, for 
instance, that the spectral problem for a one—center point interaction is explicitly 
solvable: the singularities of the resolvent kernels in Problems 17b and 21b are 
given by simple linear equations. If the point interaction is supported by finitely 
many points, the situation is more complicated, but we still have to solve a single 
transcendental equation instead of a differential one. 

Singular interactions of the type constructed above can be supported not only 
by discrete point sets. For instance, we can easily define Hamiltonians with an 
interaction concentrated at a line or plane in JR? (Problem 23); the difference is 
that now the restriction of Hp to functions vanishing with their derivatives at the 
interaction support leads to a symmetric operator with infinite deficiency indices, 
and the operators in question represent only a “small part” of the vast family of self- 
adjoint extensions. Similar singular interactions can be constructed, however, even 
for manifolds which exhibit no symmetry that would allow us to use a separation 
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of variables (see the notes). One usually speaks in these cases about Hamiltonians 
with contact interactions to emphasize the fact that the particle is influenced 
only if it “hits” a set of lower dimensionality which supports the interaction. 

Contact-interaction Hamiltonians may be again regarded as a low-energy limit 
to a more realistic quantum mechanical description in situations, where the interac- 
tion part of the Hamiltonian is negligible except in the vicinity of a curve, surface, or 
other manifold in the configuration space; they provide a suitable tool for treatment 
of various problems in solid-state physics, quantum chemistry, etc. 

Moreover, the same method can also be applied to situations where the config- 
uration space is a more complicated object than a Euclidean space. As an example, 
consider a nonrelativistic quantum particle on a graph. Such a problem is not a mere 
mathematical excercise. First of all, it has been known for decades that Schrodinger 
operators on some compact graphs provide a rough but surprisingly accurate de- 
scription of the spectra of many hydrocarbon molecules. On the other hand, a new 
physical motivation to study this problem has recently been brought by progress in 
the fabrication of metallic and semiconductor microstructures which we have men- 
tioned in the notes to Section 14.5; graphs represent simple solvable models for many 
of such “quantum wire” devices. 

A crucial point in constructing a Hamiltonian for a particle on a graph is to 
determine what happens at the branching points. We restrict ourselves to the sim- 
plest nontrivial case of a Y-shaped graph consisting of three halflines. The state 
Hilbert space will then be H := L?(JR*+) @ L?(IR*) @ L?(URt). To construct a class 
of admissible Hamiltonians, we start from the operator 


Ho := Ho: ® Ho2 ® Ao; , 


where each Ho; coincides with the operator T on a halfline from Example 4.8.5; in 
view of Problem 4.56 the deficiency indices of Ho are (3,3). Self-adjoint extensions 
of Ho are due to (4.7) parametrized by 3 x 3 unitary matrices U ; they act as 


Hy { 1, be, 3} = {-vi,- 21 3} 


on the domain 
3 3 
D(Hv) := 4 v=6+ Ng (3 +>" und’) : G €C, 6 € D(A) 
j=l k=1 


where the deficiency vectors are easily found from Example 4.8.5, g*) = { f=;0;0}, 
etc. It is again useful to characterize the operators Hy by means of boundary 
conditions. We shall find them for the subset of operators which are invariant with 
respect to permutations, i7.e., 


Pyelly C HuPje,  Pro{di, 2,03} := {d2,d1, 3} , ete. ; (14.35) 


other families of extensions are left to the reader (Problem 24). 
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14.6.5 Theorem: Any Schrédinger operator Hy on the Y-shaped graph, which 
satisfies the requirement (14.35), is given by one and only one of the following 
boundary conditions: 


(a) ¥j(O+) = av; (0+) + dl ( 41) (O+) + Y{;42)(0+)], 9 = 1,2,3, for some numbers 
a,b € IR, where {k} := k — 3[(2k—1)/6], 


(b) (0+) + ¥,(0+) + (0+) = 0 and ¥;(0+) — vj41(0+) = —cfyj(0+) - 
¥}+41(0+)], 7 = 1,2, forsome ce R, 


(c) (0+) = ¥(0+) = (0+) =: ¥/(0) and y,(0+) +2(0+) +43(0+) = dy'(0) 
for some d € IRU {cv} ; the last case corresponds to the Neumann condition 
at each halfline. 


Proof: Under the assumption (14.35), the matrix U is of the form uj, = udj~ + 

v(1—6;.), where u,v € C satisfy the unitarity requirements |u|? + 2|v|? = 

and 2Reuv + |v|? = 0. Substituting a general » € D(Hy) to the boundary 

conditions of part (a), we obtain the relations 1+u = —(e + é€uja — 2€vb and 
= —(€+ €(u+v))b — éva, where €:= e~™/*, which are solved by 


_ —e(ltutvtiutiv+iu?+iuv—2iv’) 7 iJ2v 
a (u+2v—1)(1+iu—iv) ’ ~ (u+2u—i)(1+iu—iv) 


provided the denominator is nonzero; a straightforward computation using unitari- 
ty conditions shows that Ima = Imb=0 and the map (u,v) + (a,b) is injective 
(Problem 24a). Furthermore, we can check directly that the restriction of Hj to 
the set of all ~, which satisfy the boundary condition (a), is self-adjoint (see the 
notes); the remaining cases u = i—2v and u = v-+72 are treated in the same way 
giving 
Se —v 3eu + V2 
= 5", ge - SS, 
3u v 

respectively, provided v is nonzero, while for U = 1] we get the Neumann condition 
on each halfline. § 


14.6.6 Remarks: (a) The boundary conditions (b) and (c) represent limiting cases 
of (a) — cf. Problem 24a. This shows that though the von Neumann theory 
is clumsy, it yields all self-adjoint extensions including those, which might 
be overlooked in the case of multiparameter families, when we construct the 
boundary conditions directly. 


(b) On the other hand, the obtained boundary conditions provide a local description 
of self-adjoint extensions (cf. Problem 24d), and therefore they can also be 
used at points connecting graph links of a finite length. This makes it possible 
to construct admissible Hamiltonians, e.g., for any graph consisting of a finite 
number of finite or infinite links — see Problem 25 and the notes. 
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Notes to Chapter 14 


Section 14.1 Theorem 2 and its extension to N-body systems with Coulomb interactions 
contained in Theorem 4 were proved for the first time in [Ka 3]; see also [Ka], Sec.5.5; 
[RS 2], Sec.X.2. An alternative approach to Schrodinger operators is based on quadratic 
forms: we may define the operator in question as the form sum of Ho and V(Q) (ef. 
Remark 4.6.12). The counterpart of the perturbative argument of Theorem 2 is then 
provided by the KLMN-theorem; the advantage is that this allows us to define Schrédinger 
operators for potentials with stronger local singularities — see, e.g., [Si 1}. 

To illustrate this claim, recall that by Example 3, the result of Theorem 2 applies to 
potentials with local singularities |z—zo|~*, where a < 3/2. Form sums allow us to define 
Schrédinger operators for potentials V € R+L®, where R denotes the set of potentials 
which satisfy the condition 


| WAGIIVON te dy < co: 
re |z—y/? 


these are called the Rollnik class. Since R D> L*/?(IR°) — ef., for example, [Ka 2], Sec.6 
— we see that this approach admits singularities with a < 2. The form-sum method also 
works for potentials of the so-called Kato classes — cf. [|CFKS], Sec.I.2. Moreover, it 
is also applicable in some cases when the form referring to the potential is represented by 
no operator at all; an example is given by point interactions which we discuss in Sec.14.6. 

As we have mentioned, the proof of Theorem 6 requires to check that a sufficiently 
large negative \ is a regular value of H ; this will be ensured, e.g., if (Ho+ V +A)*v =0 
implies w = 0. If V_ #0, we also have to use the result of Problem 4.22, due to which 
Ho + V_ is below bounded. Technically the argument is based on the so-called Kato 
inequality — see, e.g., [RS 2], Sec.X.4; [CFKS], Sec.1.3. 

The self-adjointness problem for Schrédinger operators with a magnetic field was 
discussed for the first time in [IK 1]; an overview of the known results can be found in 
[ We], Sec.10.4; [ RS 2], Secs.X.2,4, and also [CFKS], Chap.6. For instance, one need 
not assume the boundedness of A, and V-A in Theorem 8 (then the result applies to 
the important case of a homogeneous magnetic field) or the smoothness of Ax. 


Section 14.2 The history of the minimax principle is described in [ RS 4], notes to 
Sec.XIII.1. The finite dimensional case is discussed in detail in [ Ka], Sec.I.6. Various 
modifications of Theorem 1 can be found in [DS 2], Sec. XIII.9; a generalization to certain 
sums of eigenvalues is given in [Thi 3], Sec.3.5. 

An extensive and detailed exposition of perturbation theory can be found in the 
classical monograph [ Ka J; the reader may also consult [ RS 4], Chap.XII; [ Thi 3], 
Sec.3.5; | We], Chap.9, etc. The core of analytic perturbation theory is the treatment of 
operator families T := {T(g) € £.(H): g € G}, where G is a domain in the complex 
plane, with suitable analytic properties. 7 is called the Kato-type analytic family if 
P(T(g)) # %,, and for any go € G there is a z € p(T(go)) which belongs to p(T(g)) 
for g of some neighborhood U,, of go, and (T(-)—zo)~! is an analytic operator-valued 
function in U,,. The assertion of Theorem 7 extends to analytic families of this type. 
Proof of Theorem 6 can be found in [ Ka], Thm.1II.6.17; see also [RS 4], Thm.XII.5. 

One of the classes which satisfies the above requirements consists of the families T , 
such that the operators T(g) have acommon domain D independent of g and the vector- 
valued function g++ T(g)w is analytic in G for any ~ € D; such T is called the analytic 
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family of type (A). An example is provided by operators T(g) := Ho+gV , where V is 
Ho—bounded, considered in Theorem 7. Another often used class of analytic perturbations 
is the form counterpart of the type (A) families. 

A result analogous to Theorem 7 can also be proved if the unperturbed eigenvalue is 
degenerate, i.e., its multiplicity is greater than one. In general, however, the perturbation 
removes the degeneracy — the eigenvalue splits and the dependence on the perturbation 
parameter is more complicated; a simple example of such a situation will be given in Theo- 
rem 15.4.4 below. The algebraic derivation of the Rayleigh—Schrédinger series coefficients 
is based on expanding the resolvent (or its projection onto a chosen subspace) in a Laurent 
series; an exhaustive discussion is performed in [ Ka], Chap.II. Finally, let us mention 
that in many cases of practical interest the hypotheses of analytic perturbation theory are 
not satisfied; nevertheless, the perturbation series can be written down and summed in 
some generalized sense — see [Ka], Chap.VIII; [RS 4], Secs.XII.2,3. 


Section 14.8 If the operator (14.1) is understood in the form-sum sense, Theorem 1 may 
be extended to potentials belonging to the family R+L& corresponding to the Rollnick 
class, which is defined as the set of (equivalence classes of) functions with the property 
that for any € > 0 there is a decomposition f = frt+ foo and the L°-component satisfies 
the condition || foollo < €. In addition to (14.12), there are other estimates. For instance, 
the Calogero bound 


(ig eae [ \v(r)|"/2ar 
wT JO 


holds under the assumptions of Theorem 1 provided the function v is nonpositive and 
nondecreasing. These estimates were derived in [Ba 5], [Cal 1], [GMGT 1]; see also 
[RS 4], Sec.XIII.3; the condition for the absence of bound states of Remark 2c appeared 
for the first time in [JoP 1]. 

The Birman-Schwinger principle was formulated independently in the papers [Bir 1] 
and [Schw 1]. Lemma 3 can in the sense of Remark 4 be extended to other potentials, 
for instance, to the class R + L&° in the case n = 3 — see [Si 1], Chap.3. To prove 
Theorem 5 in the general case, we have to approximate V by potentials from C§°(JR°) 
— cf. [RS 4], Sec.XIII.3. The relation (14.15) is called the Ghirardi-Rimini bound. The 
Birman-Schwinger principle can also be used to derive the Bargmann bound — see [Si 5]; 
[ Thi 3], Sec.3.5. Various sufficient conditions for existence of bound states can be found 
in [ChD 1], [CMF 1], [BChK 1]. 

The conclusion of Theorem 6 is valid for a considerably wider class of potentials — 
see, e.g., [Si 6], [BGS 1]. What is important is that the potential need not be purely 
attractive: apart from some regularity conditions, it is sufficient that fV(x)dr <0. It 
is clear from the proof that the existence of the bound state follows from the fact that 
the resolvent in the p-representation, i.e., the function p ++ (p?+ )~! ceases to be 
integrable in the limit x — 0 if nm = 1,2. A more detailed analysis of this singularity 
yields the weak—coupling behavior of the lowest eigenvalue, 


mia) =F (f,Viade) +0") ad mila) ~ ~ exp ie ie viz)de) | 


for n = 1 and n = 2, respectively, provided the integrals are negative. A similar idea 
can be used to modify the Birman—Schwinger bound to obtain bounds on the number of 
bound states in these cases — cf. [Set 1], [K] 1], [New 1]. 
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Theorem 7 was proved in different ways in the papers {Ros 1], [Lie 2] and [Cwi 1]; see 
also [Si 4], Sec.III.9; [RS 4], Sec.XIII.3. A distinguishing property of the bound (14.17) 
is its correct behavior for large values of the coupling constant. Under the assumptions of 
the theorem, the quasiclassical relation 


pn/a _ al i V_(2)I"/2 de 
jim, go N(QV) = ch ey (x)| 


is valid — see, e.g., [Lie 3] and [RS 4], Thm.XIII.80. The results collected in Remark 8 
are discussed in [LiT 1], [GGM 1], [Lie 3]; see also [Si 4], Sec.III.9. A more detailed 
analysis of the number of bound states for potentials having the critical decay at infinity 
can be found in [St 1]. 

The discrete spectrum of hydrogen-type atoms is probably discussed in every quantum 
mechanics textbook; a detailed treatment of Schrédinger operators describing atoms with 
one or two electrons can be found in [ Thi 3], Secs.4.1-4.3. Theorem 11 was proved in 
[Ka 4], where the helium atom with a finite nucleus mass M was also considered. In 
that case the center-of-mass Hamiltonian contains the Hughes—Eckart term —M7'P, - Po 
(cf. Problem 11.13a); T. Kato demonstrated that the number of bound states is not less 
than 370(no + 1)(2no + 1), where no is the greatest integer satisfying the inequality 
ng < (1—Z-!)?M/m. It follows that the real helium atom has at least 25585 bound 
states; a more sophisticated estimate due to | Zi 1] shows that the discrete spectrum 
of the neutral-atom Hamiltonian is again infinite. Let us remark that a more realistic 
treatment of atomic and molecular Hamiltonians requires electron spins to be taken into 
account even if the interaction is purely Coulomb; the reason is that the electrons obey the 
Pauli principle, and therefore we are in fact interested in the spectrum of the appropriate 
Schrédinger operator restricted to the subspace with a prescribed permutation symmetry. 
A review of this problem is given in [ AZS]; see also [ RS 4], notes to Sec.XIII.3. 

The mentioned conclusion about the infiniteness of og(H) for atomic and molecular 
systems may not be valid for negative ions. The simplest example is the H~ ion which 
was shown in [Hil 1] to have just one bound state; this illustrates at the same time that 
Theorem 11 is not valid for y = 2. More generally, there are upper estimates on the 
maximal number N(Z) of electrons which a nucleus of charge Ze can bind; in particular, 
it has been shown that large atoms are asymptotically neutral, i.e., N(Z)/Z — 1 as 
Z — co —~'see; e-g., | Rus 1], [lie], (LSST 1], [ELS 1]; | CPKS}; Chap.3.” Similar 
properties have been demonstrated for diatomic molecules -- [Rus 2], [Sol 1]; for other 
properties of ions see, e.g., [Rus 3], [SSS 1]. 

Theorem 12 can be proven for a substantially wider class of Schrédinger operators in- 
cluding Hamiltonians of N particles interacting via two-particle potentials Vin €-Raak> , 
Schrédinger operators in L?(JR") with potentials V € L2,-UR") satisfying the condition 
lim)z}+00 V(Z) = 00, etc. - see {RS 4], Sec.XII1.12; [We], Sec.10.5. 

One of the important questions for Schrédinger operators is finding lower bounds for 
the ground-state eigenvalue. In some cases such estimates are of fundamental physical 
importance. Consider, for instance, a system of atoms consisting of N electrons and 
K nuclei described by the appropriate Hamiltonian H which involves their Coulomb 
interactions, and denote Ey,x := info(H). The problem of stability of matter means 
to determine whether the binding energy of such a system is an extensive quantity, i.e., 
whether Ey,~ > —C(N+K) holds for some constant C. This inequality was proved 
by A. Lenard and F.J. Dyson; E.H. Lieb and W. Thirring managed to obtain a realistic 
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estimate of the constant C — cf. the reviews [Lie 5,6] and [ Thi 3], Sec.4.3. It 
is crucial for the proof that electrons are fermions, because the inequality holds only on 
the “antisymmetrized” subspace in the domain of the corresponding Schrédinger operator. 
The ground state energy Ey in asystem of N charged bosons fulfils, on the contrary, the 
inequality Ey <—DN*/> — see [Co 1], [CLY 1]. The stability of atoms and molecules 
interacting with a magnetic field, i.e., the boundedness from below of the spectrum in 
dependence on the field intensity has been studied in [FLL 1], [LL 1]. 


Section 14.4 In addition to the mentioned extension of Theorem 1 to V € I? 4+ L© 
in the n-dimensional case, the same result holds for p = n/2 if n > 5, as well as for 
V ¢ R+L@ in the case n = 3 — see [RS 4], Thm.XIII.15. The assumption about the 
boundedness of the potential from below in Theorem 2 can also be weakened; it is sufficient 
that the negative part V_ € L?+L™ , where p satisfies condition (14.2). Stronger negative 
singularities of the potential are admissible provided operator H is defined as a form sum 
— ef. [RS 4], Thm.XIII.69. 

Theorem 3 appeared for the first time in [Mol 1]; see also [GI], Sec.28; if the operator 
H is defined as a form sum, it is sufficient to assume V € L}.(R). In the original paper 
A. Moléanov proved a similar criterion for the n-dimensional case; however, the latter is 
of little practical importance because the interval (a,a +6) is not replaced by a cube in 
IR” but by a more complicated set. 

The described transformation of an N-particle Hamiltonian for a given clustering of 
the particles is usually referred to as cluster decomposition; to obtain a better idea about 
it we recommend the reader to work out in detail a simple nontrivial case (Problem 12). 
The proof of Proposition 14.4.4 and the positivity of Tp uses the so-called cluster Jacobi 
coordinates, i.e., the Jacobi coordinates for the 3n(D)-—3 degrees of freedom corresponding 
to the motion of cluster barycenters, completed by arbitrary 3N —3n(D) coordinates 
describing the motion within the clusters. The operator Tp is then unitarily equivalent 
to the Hamiltonian of n(D)—1 free particles — a more detailed discussion can be found 
in [RS 3], Sec.X1.5. 

Theorem 5 is named after G.M .Zislin, C. van Winter and W. Hunziker, who proved it 
for different classes of potentials; for Schrodinger operators defined as form sums its validity 
can be extended to potentials Vj, € R+L2° — see [Si 1], Sec. VII.3 or [RS 4], Sec.XIIL.5. 
Since in physically interesting cases some of the particles involved are usually identical, one 
has to study the restriction of H to the appropriate permutation-symmetry subspace; a 
detailed discussion for atomic systems including the interaction with an external potential 
is given in [JW]. 

Proposition 6 follows from scattering theory which we shall discuss in the next chapter. 
It is amusing that the same method can also be used to prove the absence of an absolutely 
continuous spectrum for a class of one-dimensional Schrodinger operators with a family 
of growing or broadening potential barriers to both sides of a given point — cf. [SSp 1]. 

Theorem 8 and the unitary equivalence mentioned in Remark 9a appeared for the first 
time in [Ka 2]; see also [RS 4], Thm.XIII.21; their validity extends to an arbitrary poten- 
tial V € R provided H is defined as a form sum. Examples showing that the singularly 
continuous spectrum may be unstable with respect to finite-dimensional perturbations 
can be found in [RS 4], Sec.XIIL6. 

The idea of Example 10 was formulated by E. Wigner and J. von Neumann in 1930; 
it is also possible to construct examples of potentials which produce any prescribed finite 
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set of positive eigenvalues — cf. [Al 1]. Theorem 11 was proved in [Ka 5]; it represents 
a particular case of the Kato-Agmon-Simon theorem, which also admits potentials with 
a slow decay at infinity provided they do not oscillate — see, e.g., [RS 4], Thm.XIII.58. 
The formal virial theorem mentioned in Remark 14 was introduced in quantum mechanics 
by B. Finkelstein in 1928; the proof of Theorem 13 was given for the first time in [We 1]; 
see also [ Al 1], [Si 4]. 

The existence of embedded eigenvalues for multiparticle systems with bound subsys- 
tems is quite natural. Consider, for instance, operator (14.18), whose point spectrum for 
y = 0 is o,(H,) = {=nj7? —njz? : 21, ne yd...) ; cimeenoess(Ha): =-(—1neo) ky 
Theorem 5, the eigenvalues with n,,n2 > 2 are embedded in the essential spectrum. If 
we switch in the electron repulsion, i.e., take -y # 0, a part of these eigenvalues turn into 
resonances as we mentioned in the notes to Sec.9.6. 

In addition to the interactions considered here, there are other important classes 
such as periodic potentials. Consider the simplest situation, where H := Ho+ V(Q) is 
the Schrédinger operator on L?(JR) corresponding to a bounded measurable function V 
which satisfies the condition V(x+6) = V(x) for some 6 > 0. The operator H can then 
be shown to be unitarily equivalent to the direct integral (27)7! So-2n)(To+ V(Q)) dd, 
where Ty is defined in Example 4.9.1 and V is the restriction of the potential to the 
interval (0,b) denoted by the same symbol; this decomposition is usually named Floquet 
by mathematicians and Bloch by solid-state physicists. 

As in the case of a direct sum, the spectrum of the direct integral is a union of the 
component spectra, o(H) = Usejo,on) 7(Tet+ V(Q)) . If V =0, the eigenvalue ;(¥) runs 
by Example 4.9.1 through the interval [(27k/b)?, (27(k+1)/b)”) , so we obtain o(H) = IRt 
as expected. On the other hand, if V # 0 the intervals covered by the eigenvalues may 
leave gaps ; the spectrum consists in this case of a finite or infinite number of bands. A 
discussion of periodic potentials with a literature guide can be found in [RS 4], Sec. XIII.16. 
If a periodic Schrédinger operator is perturbed by a localized potential, eigenvalues may 
appear in the gaps — see, e.g., {ADH 1]. The importance of such operators stems from 
the fact that they can be used to model crystal impurities. 


Section 14.5 The definition of the Dirichlet Laplacian again conventionally includes the 
minus sign. Using quadratic forms we can also define other operators describing a free mo- 
tion in the region G,, in particular, the Neumann Laplacian which generalizes the operator 
Tn of Problem 4.61. As we have stressed, different “boundary conditions” correspond to 
different choices of the dynamics of the problem. Theorem 2 holds for a wider class of 
regions than those considered here — cf. [RS 4], Secs.XII1.14 and XIII.15. The relation 
(14.26) was derived by H. Weyl in 1911. In the quantum -mechanical context, it expresses 
the well-known principle that in a quasiclassical regime when there are many eigenvalues, 
their number equals roughly the phase space volume occupied by the corresponding clas- 
sical system, divided by (27)" (or (27h)" in the standard system of units). Using this, 
we can derive the asymptotic relation for N(gV) mentioned above; notice that c? of 
Remark 14.3.8 is nothing else than (27)~"vu,. Estimates of the asymptotic behavior for 
sets with a less regular boundary are given in [ vdB 1]. 

In addition to the asymptotics (14.26), the distribution of eigenvalues of H§ for 
regions of different shapes offers many interesting problems. For instance, various bounds 
can be derived for ratios and distances of the eigenvalues — see, e.g., { ASHS 1], [AB 1,2], 
[AE 1]. An inverse problem for Dirichlet and Neumann Laplacians is to reconstruct the 
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shape of G from the knowledge of the spectrum; this is content of the M. Kac’s famous 
question whether one can hear the shape of the drum — for a recent negative result see 
[GWW 1]. Still another problem is represented by the distribution of the eigenvalue 
spacings in billiard-type regions; a type of this distribution determines whether such a 
system is integrable or chaotic — cf. the notes to Sec.8.3. 


The investigation of Schrodinger operators describing particles in tubes, layers, and 
more complicated constrained structures has recently gained a strong impetus from the 
technological progress that has made it possible to produce various tiny semiconductor 
(or metallic) structures of a very pure material — see, e.g., [Sak 1], [TBV 1], [Bar 1], 
[GoJ 1], [VOK 1], [WS 1], and references given in these papers, and also [ ES 2] 
for an overview of related mathematical problems. Proposition 3 and Theorem 4 were 
proved in [ES 1] under weaker assumptions than those used here. Instead of y € CS, 
it is sufficient that y is twice continuously differentiable and 7, 7’, 7’ are bounded; the 
isolated eigenvalues then exist provided the curvature decay is O(|s|~!—*) as |s| — 00 
or even slower; the same is true for a bent circular tube in IR? — cf. [Ex 7]. Let us 
also remark that the mentioned regularity requirements concern the proofs (based on the 
transformation (14.28)) rather than the essence of the effect; by other methods one can 
check the existence of bound states in infinite regions with nonsmooth boundaries — see, 
e.g., [ESS 1], [SRW 1]. 

In practice, a “quantum wire” is always coupled to macroscopic leads; in a such situ- 
ation the curvature-induced bound states produce resonance scattering — cf. [Ex 8] 
— which have observable consequences in conductivity properties — see, e.g., [TBV 1}. 
Resonances of a different type, which are also due to the potential (14.29), occur in in- 
finitely long tubes at energies close to the transverse-mode thresholds KG, i) ee, Wists 
effect is discussed in [DuES 1] and represents another example of embedded-eigenvalue 
perturbation theory. Curvature effects also exist in layers. For instance, if a sufficiently 
thin smooth layer has a straight edge, low-energy particles move along it — ef. [ESS 2] 
and Problem 16. 


Section 14.6 Point interactions regarded as formal 6-function potentials were introduced 
into quantum mechanics by R. Kronig, W.G. Penney, E. Fermi, and others in the thirties; 
rigorous interpretation using self-adjoint extensions is due to [BF 1]. Later they became 
an object of intensive study, in the first place by the group of S. Albeverio and R. Hgegh-— 
Krohn; the results are summarized in the monograph [| AGHH], where a rich bibliography 
can also be found; see also the proceedings [ES 1,2], [DE], [EN], and [AFHL]. Physical 
aspects of point interactions are discussed in [DO]. 

Theorem 2 is valid for any V € L'!(JR) provided the approximating operators are 
understood in the quadratic form sense — cf. [AGHH], Chap.I.3, where one can also find 
the proof of the claim contained in Remark 3a. 6-interactions can also be approximated by 
Schrédinger operators with scaled short-range potentials in dimension two and three, even 
in the situation when they are supported by an infinite set without accumulation points; 
the difference is that the coupling constant requires in these cases a renormalization, 1.e., 
other than natural scaling as e — 0 — see [AGHH]. The approximation of the class of 6’— 
interactions introduced in Remark 3b by P?+ 9(€) Eye), where the Ey.) are projections 
to one-dimensional subspaces spanned by vectors w(e) that “shrink” as « — 0 and the 
coupling constant g(e) diverges in the limit, is discussed in [Seb 3]. In a fixed interval 
of energy, the 6’-interactions can alternatively approximated by Schrédinger operators 
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on suitable graphs — see [AEL 1]. Some of the “remaining” point interactions in one 
dimension have been treated, e.g., in [Seb 4] and [ChH 1}. 

There are other examples of how we can use a symmetry of a submanifold in the con- 
figuration space to construct a contact-interaction Hamiltonia; let us mention interactions 
supported by a sphere [AGS 1], a family of concentric spheres [Sha 1], etc. Other meth- 
ods make it possible to define and analyze contact interactions on sets with no symmetry, 
and also to obtain wider classes of Hamiltonians in symmetric cases; loosely speaking, the 
coupling constant is allowed to vary along the interaction support — see, e.9., [ Te 1], 
{BT 1], [BEKS 1]. Let us stress that such constructions also work for “wild” sets such 
as curves which are continuous but nowhere differentiable — cf. [Che 1}. 

The idea of modeling organic molecules by electrons living on the molecule “skeleton” 
was formulated by L. Pauling in 1936; for a review see (ES 5]. The proof that each 
of the boundary conditions of Theorem 5 defines a self~adjoint operator is similar to 
Theorem 4.9.2; it can be found in [ES 6]. Graphs with loops exhibit interesting transport 
properties; in particular, if they are placed into an electric or magnetic field — see, e.g., 
{Bu 1], [WWU 1], [ARZ 1], {ESS 3], [AAAS 1]. Scattering on more complicated graphs 
was discussed in [GP 1]. 

The method of self-adjoint extensions also allows us to construct Schrédinger-type 
operators on configuration spaces composed of sets of a different dimensionality such as a 
plane and a halfline, two planes connected through a point, etc.; the results can be used to 
model the so-called point-contact spectroscopy experiments — cf. {ES 3-5]. In a similar 
way, one can model the Helmholz resonator, i.e., a cavity with a small opening [Pop 1], a 
point interaction with an internal structure [Pav 1], etc.; another model of this type will 
be discussed in Section 15.4. 


In addition to Schrédinger operators, there are other important classes of quantum me- 
chanical Hamiltonians. One of them consists of Dirac operators 


H :=a-P+Bm+V(Q) 


on L*(R3;€4) describing a relativistic particle of spin 3. The matrix coefficients are 
A; = ( ’ rm iE where oj; are the Pauli matrices, and @ := 3 - ye: In general, 
the potential V is a matrix-valued function, but usually it of the form of a function 
V multiplied either by the 4 x 4 unit matrix (in which case we abuse the terminology 
and speak about a function) or by @. Among physically most interesting are naturally 
potentials such that V is a sum of Coulomb potentials. Dirac operators can be defined 
also in IR", n # 3 ; the dimension of the matrices, i.e., the number of wave function 
components depends on the dimension n of the configuration space. 

The present status of the theory of Dirac operators is described in detail in the recent 
monograph [ Tha]; here we limit ourselves to a few basic results. In distinction to the 
Schrodinger case, the self—-adjointness may here depend on the value of a coupling constant. 
For instance, a Dirac operator with the Coulomb potential v(z) = y|z|~! is e.s.a. on 
C§ UIR* \ {0};€*) provided || < 3/2; if we express y in the standard system of units, 
this corresponds to the condition Z < 118, which is satisfied for all really existing nuclei. 
This fact was noted already in [Rel 1]; a simple proof of the essential self-adjointness for 
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ly| < 1/2 can be found in [ We], Sec.10.6. On the other hand, if the condition is violated 
there is a distinguished self-adjoint extension of such a Dirac operator — the only one 
which exhibits finite mean values of the kinetic and potential energy; these conclusions 
extend to potentials with several Coulomb-type singularities — cf. [Nen 3], [Kla 3], 
[Kar 1]. It is interesting that if the Dirac particle has an anomalous magnetic moment, 


i.e., the potential V is replaced by V— #&17-VV, where 7; := ( ne, MA ) , then the 
Dirac operator with V(x) = 7|2|~! is e.s.a. on Cg°(IR?;C*) for any |y| — see [Beh 1], 
[GST 1]. 

Under the above mentioned conditions, too, the essential spectrum of the Dirac ope- 
rator is (—oo,—m]U[m,oo) as in the free case; in the standard system of units m is 
replaced by the rest energy mc?. Since H is below unbounded, we lose some efficient 
tools of the Schrodinger theory such as the quadratic form method, the minimax theorem, 
etc. The discrete spectrum is contained in the gap; some results concerning this can be 
found, e.g., in {K1 2], [Gr 1]. In comparison to Schrédinger operators, Dirac theory has 
very few exactly solvable models. One of them, which is known from quantum mechanical 
textbooks, concerns a single-center Coulomb potential (relativistic hydrogen atom); a 
generalization to an arbitrary dimension is given in [Wo 1]. 

We have mentioned that the mass parameter m in the Dirac operator is in fact 
mc/h. By a natural consistency requirement, the operator H —(mc/h)I has, as c > co, 
in some sense to approach the corresponding Schrédinger (or Pauli) operator describing 
the nonrelativistic spin 5 particle interacting with the considered potential. There are 
numerous papers treating the nonrelativistic limit of the Dirac theory; as an example, let 
us quote [GGT 1,2]. 

Since Dirac operators are first-order differential operators, they are e.s.a. when re- 
stricted to a set of functions vanishing at a set of codimension greater than one — cf. 
[Sve 1] — and therefore point interactions analogous to Theorem 14.6.4 and Problem 20 
cannot be constructed. On the other hand, room is open for contact interactions suppor- 
ted by subsets of codimension one, say, by a sphere in JR® ; such Dirac operators have 
been constructed in [DES 1,2]. Point interactions exist in dimension one — see [GS 1]; 
Dirac operators on graphs have been discussed in [BT 1]. 

Still another class of Hamiltonians consists of operators of H := /P?+m? +V(Q), 
which may be used to describe relativistic spinless particles in a fixed reference frame. 
In distinction to the Dirac case, these operators are below bounded and many spectral 
properties of Schrédinger theory can be extended to them — see, e.g., [DL 1], [Lie 7], 


[LY 1], [Nar 1]. 


Problems 


1, Let (L?+ L&)(iR") be the set of (classes of) functions f : IR" — € with the 
following property: for any € > O there is a decomposition f = fpe + foo,e such 
that fe El, foe EL™, and || foo,elloo < €- 


(a) the Coulomb potential V : V(z) = y|2|~', belongs to (L?+ L2°)(UR%), 


(b) more generally, what local singularities and decay at infinity are allowed if a 
function belongs to (L’+ L&°)(UR") ? 
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2, Let Ho := P? be the free Hamiltonian on L*(JR"). Show that the inequality 
He! wll < I(SHo + (48)-")¥l| < Bil Hovll + (48)~"IIv|| is valid for any 6 > 0 and 
an arbitrary 7 € D(Ho). 

3. Let D, and D,_1 be subspaces of dimension n and n—1, respectively, in a Hilbert 
space; then there is a nonzero vector wp € Pagel Pre 
Hint: DnaOD2_, = {0} would mean that the projection of a nonzero ¢ € Dewto 
Dr-1 is nonzero; choose linearly independent ¢1,...,¢n € Dn- 


4. Find examples of Hermitean operators B on an infinite-dimensional 7 such that 


(a) oess(B) consists of a single point \ and the parts of the discrete spectrum, 
which we denote as oa(B), in both the intervals (—oo, A) and (A,0o) are 
nonempty, 


(b) oess(B) is a two-point set {A}, A2} and og(B) is contained in (Aj, A2). 
Hint: (a) Consider T; of Example 4.1.4 with a suitable alternating sequence. 
5. Check the relations of Remark 14.2.2c. 


6. Using functional calculus, prove Theorem 14.2.6 for a self-adjoint T ; show that the 
projection P is orthogonal in this case. 


7. Let ve L©URt). Prove: 


(a) the operator h: (hf)(r) = —f"(r) + (Br-? + yr7! + v(r)) f(r) on L?URt) 
is e.s.a. on C§°(0,0o) for any ye R if BES 3 ; in the opposite case its 
deficiency indices are (1,1), 

(b) let H := Ho+V(Q) be the Schrédinger operator on L?(JR*) with the centrally 
symmetric potential, V(z) = u(r). Check that the s-wave operator ho : 
hof =—f" + vf satisfies the Dirichlet boundary condition at the origin, i.e., 
D(ho) = { f € AC?(0,00) : f(0) =0}. 


Hint: (a) Use Theorem 4.8.7. (b) If f € D(ho), then U~'(f¥o0) given by (11.16) 
must belong to the domain of H. 


8. Using Theorem 14.3.1, find upper bounds to the number N)(V) := dim Ey (—00, 4) 
of bound states with energies smaller than .. 
Hint: Put V_(A, x) := min{V(xz)—., 0} and use the minimax principle. 


9. Prove: (a) If n = 1,2, we can choose ¢ € L?(IR") so that (14.16) is valid. 
(b) fre 42AY dx dy < 0 holds for a real function f € (L'N L?)(IRS). 


Hint: (a) The expression in question equals fyzn |x(p)|?(p?+ «2)~'dp, where x := 
F,|V(Q)|'/2@ ; choose ¢ € Cs with supp¢? > suppV. As for (b), replace the 
denominator by |x—y|?+ e? and use Problem 7.17b. 


10. Fill details into the proof of Theorem 14.3.9. 
(a) Show that (Y, Hw) > (1—4c) (v, Hew) holds for any w € D(Ho). 
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(b) If f € L?+ L© has a compact support, it belongs to L? for p € [1,2]. 


11. Let ~ € L?(JR3) be rotationally invariant, (x) = W(r) for some w € L?(IR*, r2dr) ; 
then 


[alta tWe)Par = fl min{rt, Jal} [w@)Pae 
holds for any vector a € R°. 


12. Perform the cluster decomposition for a system of three particles. Prove Proposi- 
tion 14.4.4 in this case, and check that if D consists of n clusters, the operator Tp 
is unitarily equivalent to P? on L?(IR3"—)), 


.13. Let H := Ho+V(Q) be the operator (14.1) on L?(JR?) with a centrally symmetric 
potential, V(x) = v(r) for some v: R+ > R. 


(a) Find an example of a purely attractive V such that H has a zero-energy bound 
state, i.e., there is wy € L?(IR?) such that Hy =0. 

(b) Check the result discussed in Example 14.4.10. 

Hint: (a) Consider an eigenspace of L? with @> 2 for a rectangular well. 
14. Using the notation of Section 14.5, prove 

(a) the inclusion D Cc Q(H§) holds in Example 14.5.1b, i.e., for any w € D there 
is a sequence {Wm} C C§°(G) such that wm — y and Awvm — 8, in L(G), 

(b) the relation (14.26) for a parallelepiped, 

(c) using the partial-wave decomposition, check that the conclusions of Exam- 


ple 14.5.1 extend to the ball G:={z: |z| << R} in R”. 


Hint: (a) Let ro be the center of G, and {a} an increasing sequence with 
Qm +1. A given we D can be approximated by functions, which are convolutions 
of ~(z0+ Am(-—2Zo)) with 67,j(Bm:) for a suitably chosen sequence {6}, where 
3 € CSUR") with fi(2)dr = 1. (b) The volume of an n-dimensional ellipsoid 
with semi-axes $1,...,8n 18 Un [Jj 8;- 


15. Suppose that S Cc IR? is a strip of a width d as described in Sec.14.5, and H# is 
the corresponding Dirichlet Laplacian. Denote x; := 1j/d, j =1,2,.... 


(a) Let the strip be straight, S = { (x,y): 0<y<d}. Show that H3 is unitarily 
equivalent to the operator @j2,(P?+«?) on L?(S) = @%, LR). What is 
the multiplicity of the spectrum in the ea (Ki, Kj oy) ? 


(b) The quantity (so, 51) := fe, y(s) ds represents the bending of the curve, i.e., 
the angle between the femaent vectors taken at s; and sq. Points of the curve 
I are then expressed by 


g(s) = &(s0) + f "cos B(S0, 81) ds1, (8) = n(so) + i "sin B(S0, 81) ds, 


for a fixed so. 
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(c) Prove Proposition 14.5.3. 

16. Suppose that S is a curved strip of a width d, which satisfies the assumptions 
of Theorem 14.5.4, and consider the Dirichlet Laplacian H if corresponding to the 
curved layer L:= Sx R. 

(a) Find a unitary operator analogous to (14.28) which transforms H into an 
operator on L?(0,d) @ L?(IR?). 

(b) Check that the spectrum of H & is purely continuous and equal to [o, 00) , 
where Xp := inf o( Hj). 

(c) Assume that d is small enough so that Ay < «2. Write the action of the unitary 


propagator associated with H& on vectors from the range of the spectral 
projection Ey1 (—Ao, K?) . 


17. Consider the 6—interaction for a particle on line of Section 14.6. 


(a) Prove Proposition 14.6.1. 


(b) Given k with Imk > 0, denote by G, the free resolvent kernel of Problem 7.9, 
G.(2,2') = K_ux(2,2') = & etklz-z'l If k # —ia/2, the resolvent (Hay —k?) 
is an integral operator with the kernel 


2ak 
G,(2,2') — = mm Gi (2, y)Gr(2',y) - 


oo, while for a € (—oo,0) there is ex- 
actly one eigenvalue — } a? 
J/—2a etlt-wl, 


(c) Geas( Hey) =-Cee(Hag) =JR* and qeliieg) =O. Thespointespacimumiof 
Hay is empty if a > 0 or a 
corresponding to the eigenfunction We: Ve(r) = 


(d) Fill the details into the proof of Theorem 14.6.2. Check that up to an overall 
scaling factor, the approximating operator is transformed by the dilation group 
of Example 5.9.5b into 


€’Ua(e)He(V, y)Ua(—e) = P? + ee7*P¥/*V(Q). 
(ce) Let Y = {a 
CT, -. 


j = 1,...,N} be a finite subset of the real axis. To any 
., Qn € (—00, oo] construct the point interaction Hamiltonian supported 
by Y with the é6-interaction of strength a, at the point y,. Generalize to 
this case assertions (b), (c), and Theorem 14.6.2. 

Hint: (b) Use Theorem 4.7.15. (¢) Use Theorem 14.4.7. 


18. Consider the 6’ interaction specified by the boundary conditions (14.33). 


(a) Find the spectrum of H By and an integral-operator expression for its resolvent. 


(b) Extend the results to a 6’-interaction supported by a finite subset of R. 
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(c) Generalize the conclusions to the point interaction specified by the boundary 


conditions 
Wt) -V'u-) = 5+) +¥0-)) + JWWH+V-)), 
(yt) -vy-) = -7 wo) + W(y-)) + E wit) tv) 


with a,8 € IR and yEeC. 


19. Let Ho be the free Schrodinger operator on L?(JR") and denote by Hp its restriction 
to C§° UR" \ {0}). Using the partial-wave decomposition, prove 


(a) Ao is e.s.a. if n > 4; this conclusion remains valid if the origin of the 
coordinates is replaced by any point y € JR", 


(b) in the cases n = 2,3 the partial-wave components of Ho with a nonzero & are 
e.s.a. while the s-wave part of the operator has the deficiency indices (1,1). 


Hint: Use Problem 7a and Remark 11.5.3. 


20. Construct single-center point—interaction Hamiltonians in L?(JR?). Show that in 
contrast to the one—- and three-dimensional cases, such an operator has always a 
negative eigenvalue unless it is trivial. 


21. Consider the point-interaction Hamiltonian Ha, of Theorem 14.6.4. 


(a) Prove that the resolvent (Ha,y — k”)~! is for any k 4 —4ria with Imk > 0 
an integral operator with the kernel 


ie, eee 
G,(2, 2 ) ae aa 4k G,(2, y)G(xr y) ’ 


where G; denotes the free resolvent kernel of Problem 7.19b, i.e., Gx(x, 2’) = 
ik|z—y| 
K_»(2,2/) = Fo - 
(b) Use this result to show that os-(Ha,y) is empty. 
22. The Kronig-Penney model describes equidistantly spaced 6—interactions of the same 
strength a on line. In other words, there is L > 0 such that the corresponding 
Hamiltonian Ha,y acts as Hay = —y” in each interval Jn := (nL,nL+L), 


and its domain consists of all functions w such that wWhJ, € AG?) and the 
boundary conditions 


W(nLb+) = Y(nL—) = (nL),  Y'(nL+) —y'(nL—) = av(nL) 
are valid at each point of Y:={nL:neZ}. 


(a) Find the spectrum of Ha,y and show how the band and gap widths behave 
for large energies. 

(b) How does the spectrum change if we replace 6 by 6’, or more generally, by 
the point interaction of Problem 18c? 


Hint: Use the Floquet-Bloch decomposition described in the notes to Sec.14.4. 
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23. Let L, P bealine and a plane in RR, respectively, and Ho the free Hamiltonian 
on L?(R®). Using a separation of variables, 


(a) show that the deficiency indices of Ho} C§°(IR?\ L) and Ho} C§°UR?\ P) are 
infinite, 


(b) construct Hamiltonians with a contact interaction supported by L and P, 
respectively, invariant with respect to translation along the manifold. 


24. Consider the operators Hy on the Y—-shaped graph of Section 14.6. 


(a) Fill the details into the proof of Proposition 14.6.5. Show that the boundary 
conditions (b) and (c) represent the limiting cases of (a) for a,b —- too with 
c:=b—a and d:= 3(a + 2b), respectively, preserved. 

(b) Check that any Hy with the continuous wave function, ~)(0+) = w2(0+) = 
~3(0+) =: w(0), corresponds to a one-parameter subclass of the boundary 
conditions from Proposition 14.6.5a, (0+) + ~5(0+) + ¥4(0+) = cw(0) for 
some c € IRU{co}, where the last named case refers to the Dirichlet boundary 
condition at each halfline. 

(c) Find the family of boundary conditions determining Hy with the wave func- 
tions continuous between a pair of the halflines only, e.g., ~,(0+) = w2(0+). 

(d) Show that the boundary conditions, which determine a particular extension, 
mean in fact the conservation of probability current through the junction, 
DL, Im H,(0+)v}(0+) = 0. 

(e) Extend Proposition 14.6.5 to the case of n halflines connected at one point. 


25. Under the assumption of wave function continuity, find the admissible Hamiltonians 
for a particle on 


(a) a T shaped graph, i.e., two halflines and a line segment connected at one point, 
(b) a loop with two halfline leads. 


Chapter 15 


Scattering theory 


15.1 Basic notions 


The problem of scattering of certain objects (particles, waves) on an obstacle (target) 
can be found in classical mechanics and field theory as well as in quantum theory. In 
the last case it is particularly important because intentionally prepared collisions of 
particles (nuclei, atoms, etc.) represent one of the very few efficient ways of studying 
their structure. 

Recall first some basic concepts. Scattering may be a natural process such as 
penetration of cosmic rays through the Earth’s atmosphere, part of an artificially 
prepared system like the cooling of neutrons in an atomic reactor, or an experiment 
in which we collide a beam of certain objects (for brevity we shall speak mostly about 
particles) produced by an accelerator with other particles which belong either to a 
fixed target or to another particle beam. Many scattering processes involve three 
or more particles some of which may be clustered (e.g., into atomic nuclei). The 
scattering is said to be elastic if the number of particles, their sorts, and clustering 
are the same before the collision as after; otherwise we speak about an inelastic 
process. These possibilities are not mutually exclusive; in most cases the same 
projectile and target can collide both elastically and inelastically; a simultaneous 
description of all scattering processes is called multichannel scattering. Another 
classification concerns the number of collisions involved: if it is only one we speak 
about a simple scattering; otherwise we have a multiple scattering process. 

A common feature of all scattering situations is that at the initial and final stage, 
when the projectile and target are far apart, their interaction can be neglected and 
the time evolution is governed by a “free” Hamiltonian Ho rather than the full 
Hamiltonian of the system. For instance, if two spinless particles which interact via 
a potential are scattered, then the full Hamiltonian is the corresponding Schrédinger 
operator, while the free Hamiltonian is the Schrodinger operator with zero potential; 
one can easily find other examples. Hence a scattering system is characterized by a 
pair of self-adjoint operators H, Ho to which the propagators 


Wer ues Walt) tee" 
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correspond; we again call them the full and free propagator, respectively. 

Now we want to compare the two dynamics. First of all, we have to give a 
precise meaning to the heuristic claim that for very large positive or negative times 
the system behaves as it would be free. It is obvious that this is not true generally: 
if the projectile and the target are in a bound state described by an eigenvector of 
H , then they stay in this state forever. Motivated by this observation we associate 
with H aset M,(H) C H whose elements describe scattering states. We have to 
specify, of course, the properties that such states are supposed to have. This will be 
done below; for the present moment we assume only that 


(sl) M,(H) is U(t)-invariant for all te R. 


This requirement is physically natural, expressing the time homogenity: the set of 
scattering states certainly should not depend on the instant when we decide to 
perform the experiment. Denoting the corresponding projection by E,(H), we can 
write the assumption (sl) in the form [E,(H),U(t)] =0 forall te R. 

Using scattering states we can express the basic idea of scattering theory as the 
asymptotic conditions: we assume that for any state y% € M,(H) there are vectors 
ws € M,(Ho) such that 


jim, [|U(t)d — Volt)pall = 0; (15.1) 


the states associated with them are called asymptotic states; these allow us to express 
the “initial” and “final” state as Uo(+t)y= , respectively, for large negative t. Using 
the unitarity of the propagators, we may rewrite (15.1) as 


we = lim Un(t)'U(t)Y, Y= lim U(t)Uolt)vs ; 


the first of these relations shows that the maps w +> wx are injective. 

On the other hand, the ranges of these maps should be the whole set M,(Ho) ; 
in particular, an “initial” state which belongs to the set of scattering states of the 
free Hamiltonian should evolve into a scattering state of the system. In other words, 
we suppose that for any ~_ € M,(Ho) there is a vector w € M,(H) such that 
(15.1) holds, and the same for 4 € M,(Ho). In view of the above relations, this is 
equivalent to the assumption of existence of the operators 


lf = s- lim U(t)*Uo(t)E,( Ho) . (15.2) 


These are called the wave operators; we also denote them as 24(H, Ho) when we 
want to stress the pair of operators to which they correspond. 

By definition, the wave operators map into the set of scattering states of the full 
Hamiltonian, RanQu C M,(H). It may happen, however, that one or both of the 
asymptotic conditions are not fulfilled for some y € M,(H), i.e, RanQy # M,(H). 
An example of the situation, where the “positive” asymptotic condition is violated, 
is provided by a process in which the scattered particle is eventually captured by 
the target and never leaves the interaction region. 
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We are particularly interested in the case when the wave operators satisfy the 
asymmetric relation 
RanQ_ c RanQ, ; (15.3) 


then we have the following scheme of a scattering event: for any w_ € M,(Ho) 
the vector-valued function Up{-)w_ describes a state of the system in the distant 
past. The map ~_ + w := N_y_ € M,(H) defines a vector describing the state at 
t = 0, to which the initial state has evolved by means of the full propagator U(-). 
Since ~ belongs by assumption to RanQ,, there is a vector ~, € M,(Ho) such 
that y = Oy, ; in view of (15.1) we have U(t) = Up(t)y4 as t + +oo. Speaking 
figuratively, we may say that the vectors #4 determine “free asymptotes” to the 
trajectory U(-)y of the state vector. 

The scattering experiment now consists of comparing in the distant future the 
state Uo(t)w4 with Uo(t)d for different ¢ € M,(Hp), i.e., with states whose evolu- 
tion was governed all the time by the free propagator. The corresponding transition— 
probability amplitude (Uo(t)d, Uo(t)w+) = (¢,~+) can be expressed through the 
original vector ~_ because assumption (15.3) implies the existence of the operator 


S:= M0, (15.4) 


which connects the asymptotic states, ww, = Sw_ ; it is called the scattering 
operator (or S-operator, S—matrix). 

To make use of these definitions we have to know more about scattering states, 
in particular, to give meaning to the above described intuitive distinction between 
bound and scattering states. Intuition leans on properties of simple quantum me- 
chanical systems consisting of one or several particles, where the probability of find- 
ing the system within bounded spatial regions is clearly essential. Consider therefore 
a family { M, : r > 0} of subsets in the configuration space C’ which characterize 
the localization; we suppose that they are fully ordered by inclusion and such that 
U->0 M, = C. For instance, if the scattering system consists of just two particles, 
we can choose M, := IR® x B, , where the B, are concentric balls of radius r in the 
center-of-mass frame. Furthermore, denote by Fi. the projection to the subspace 
of functions with supports in M, ; then we have 


s-lim F, = jp) 
r—00 
A natural requirement on scattering states is that the probability of finding the 


system within a fixed bounded region vanishes as t — too. Since the latter equals 
\|F-e*# "||? | we define 


M(t) ie {¥ EH: ee Fre#tyy = 0 forall r> o} : (15.5) 
On the other hand, bound states have just the opposite property: the probability 


of finding the system outside a sufficiently large region remains small for all times. 
Hence we put 


M(H) := {v eH: lim, sup ||( — Fel] = o} (15.6) 
te 
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Using the definitions we can derive the elementary properties of these sets. 


15.1.1 Proposition: The sets of scattering and bound states are mutually ortho- 
gonal subspaces in # and 


M(H) > 1,(H), M(H) C H-(H), 


where H,(H) is as usual the closed subspace spanned by the eigenvectors of H and 
H(A) = Hp(H)* = Hac(H) ® Hsc(H) 

Proof: Using the inequality {| + ||? < 2||||?+ 2\|@||?_ we check that M,(H) and 
M,(H) are subspaces in H. Let {yn} C Ms(H) be a sequence converging to some 
yw €H. The operators F, and U(t) have unit norm, so 


FUG)? < hv — dll? + AAU (t)ynll?. 


By assumption, for any € > 0 there is an n such that |\~ — dnl? < §, and since 
Yn € M,(H), we can find a to so that ||F,U(t)~,ll? < § holds for all t with 
|t} > to. Then ||F,U(t)||? <€,so ~ € M,(H) and the subspace M,(H) is closed. 
On the other hand, choosing {%,} C M,(H), we deduce in a similar way from the 
inequality 


sup ||(I — F,)U (tll? < 2\lb — Yall? + 2 sup || — F)U(t)dall? 
teR teR 


that ~ € M,(H), i.e., the subspace M,(H) is closed too. Consider now arbitrary 
vectors y € M,(H) and ¢ € M,(H) ; then we have 


I(d,9)? = \(U)e,U¢)P 
= |(FU(t)v,U()d) + (U(t)Y, (I - F)U(t)9)P 
< 26l7IAUOwI? + 27 — FU dll? 
< 


2 
2 (GI? FU (e)vll? + 2 lI? (sup I(r — FU wl) 


After the limit t — too, the first term on the right side disappears and the other 
limit r — oo also annuls the second term, so (¢,~) = 0, i.e, M,(H) 1 M(H). 
Finally, assume that w is an eigenvector of H, Hw = Aw ; then we have 


sup ||(I — Fre"? = || - Fv? + 0 
teR 
as r—+ 00, so W belongs to My(H) , and the same is true for the subspace H,(H) 


spanned by all the eigenvectors. Combining this result with the orthogonality of the 
subspaces, we get 


M,(H) C M(H)* Cc H,(H)* = H.(H), 
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so the second inclusion is also valid. [§ 


Next we are going to give an abstract sufficient condition under which the above 
inclusions turn to identities. 


15.1.2 Proposition: Let o,.(H) = @ and denote P, := Ey(—oo,a] for any 
a> 0. If the operators FP, are compact for all weaneee r,a, then 


M(H) = 7,(H), M,(4#) = H.(H) = H.-(H). (15.7) 


Proof: By assumption, H.(H) = H,,(H). Choosing a vector ~ from this subspace 
and an arbitrary € > 0), in the same way as above we obtain the estimate 


|F-U (vl)? < 2 — Pa)bll? + 2I|F-PaU (é)e/?. 


Due to properties of the spectral measure, the first term on the right side is < 5 for 
a large enough. Next we use Problem la, which gives limo F,-PaU(t)y = 0; 
hence there is a to such that the second term is also < § if [t| > to. It follows 
that ~ € M,(H), so the second of the relations (15.7) is proved; combining it with 
the preceding proposition, we find H,(H) C M4(H) C M,(H)+*=4,(H). & 


We want to know, of course, whether the identities of Proposition 2 are valid for 
concrete scattering systems. We restrict ourselves here to the simplest case of two 
spinless particles interacting via a potential; references to more general results are 
given in the notes. After separating the center-of-mass motion, our problem reduces 
to the scattering of a particle with the reduced mass on a potential V: R? — R; 
the free Hamiltonian is Hp of Example 7.5.8 while H is (the closure of) Ho+V(Q). 

We begin with an auxiliary result concerning operators of the form f(Q)g(P) 
and g(P)f(Q). If g € L’(UR’), their action, in view of Problem 7.17, can be 
written down explicitly as a composition of an integral operator and multiplication 
by a function. The operators f(Q), g(P) are generally unbounded; however, it may 
happen that the product is densely defined and bounded on its domain; following the 
usual convention, in such a case we employ the ie f(Q)g(P) and g(P)f(Q) 
for the corresponding continuous extension to L?(UR 


15.1.3 Proposition: (a) Suppose that f,g € L?(JR*) and the operators f(Q)g(P) , 
g(P)f(Q) are densely defined; then they are Hilbert-Schmidt and 


NF(Q)9(P)Il < WF(Q)g(P)ll2 = (2r)-*? IF Hall ; 
the analogous relation is valid for g(P)f(Q). 


(b) The operators F.(Ho — z)~' and (Ho — z)~!F, are compact (even Hilbert- 
Schmidt) for any z € p(Ho), r > 0, and 


M,(Ho) = Hac(Ho) = L’(R*). 
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Proof: By Problem 7.17, f(Q)9(P) acts on its domain as an integral operator with 
the kernel (x,y) + K(x, y) := (2m)~°/? f(x)(Fs'9)(y — x). It is straightforward to 
compute its Hilbert-Schmidt norm: using a simple substitution and unitarity of the 
FP-operator, we obtain the relation 


i |K (x,y) [da dy = (2r)-* || f\l? Iigll? 


which implies the above inequality. In a similar way, we can check that g(P) f(Q) 
with the kernel (x,y) ++ (2m)~°/? f(y)(Fy'g)(y—z) is Hilbert-Schmidt; this proves 
assertion (a). The operators F, and (Hy—z)~! are bounded, and the function yr, 
belongs to L?(JR°) as well as g: g(k) = |k? — z|~! (Problem 2); hence (b) follows 
from (a) and Proposition 2 in combination with Example 7.5.8. 


By part (b) of the just proved result, any state is a scattering state of the free 
Hamiltonian. On the other hand, the set M,(H) of the full Hamiltonian depends, 
of course, on the chosen potential. We have, for instance, the following result. 


15.1.4 Theorem: Let H = Hy + V(Q) with the potential V € (L? + L®)(IR®) 
and 0,-(H) = 0; then the relations (15.7) are valid. 

Proof: We know from Section 14.1 that D(H) = D(Ho) C D(V) ; hence the second 
resolvent identity gives 


(H — 2)"'F, = (Ho ~ 2)"\F, — (H — z)"'Va(Q)(Ho — 2)" Fr 


~ (H — 2)"'Voo(Q)(Ho — 2)7"F, 


for any z € p(H)Np(Ho). In view of Proposition 3b, the operators V2(Q)(Ho—z)7! 
and (Ho — z)~'F, with an arbitrary r > 0 are Hilbert-Schmidt. Furthermore, 
V.. € L®, so the corresponding operator V..(Q) is bounded, and the operators 
(H — z)~' and F, are also bounded. Hence (H — z)~'F, is Hilbert-Schmidt, and 
the same is true for the adjoint operator F,(H — z)~' ; the result then follows from 
Problem 1b. § 


15.1.5 Remark: It is essential in the above proof that the singularly continuous 
spectrum of H was supposed to be empty; this allowed us to use the Riemann- 
Lebesgue lemma in the proof of Proposition 2. If os:(H) #@ there may exist states 
orthogonal to both the sets My(H) and M,(H). In a sense, they are “between” 
the bound and scattering states: the probability of finding the system within a 
fixed bound region may not have a zero limit as |t| + +oo, but its mean value is 
arbitrarily small if taken over a sufficiently long time interval (see the notes). 


The identification of the bound states and scattering states with the pure point 
and absolutely continuous subspaces, respectively, of the full Hamiltonian, which we 
have made here for the system of two Schrédinger particles, can be also proved for 
other scattering systems. With this fact in mind, in scattering theory we usually 
replace the physically justified definition (15.5) of scattering states by the assumption 
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(s2) M.(H) = Hac(H). 


This is clearly consistent with (s1); the advantage is that now the set M,(H) de- 
pends automatically on the spectral properties of operator H only. 

Now we are able to formulate two basic problems of scattering theory. The first 
of them concerns existence of the wave operators for a given pair H, Ho ; if we adopt 
assumption (s2), definition (15.2) is reformulated as 


2s (H, Ho) #= g-lim U(E)"Uo(t) Pae(Ho) (15.8) 


We have mentioned that the scattering process may be asymmetric with respect to 
the direction of time, in particular, that the S-operator exists under condition (15.3). 
If we require at the same time that an “initial” asymptotic state %_ corresponds 
to each “final” ~,, then the opposite inclusion must hold too (cf. Problem 4a). 
Finally, if we add the assumption that the validity of the asymptotic condition is 
not restricted, z.e., a pair w_, ~, corresponds to any scattering state w~ (recall 
that ws are always mapped into H,.(H) — see Problem 3b) we obtain 


Ran 24(H, Ho) = RanQ_(H, Ho) = Hec(H). (15.9) 


If this condition is satisfied the wave operators are said to be complete. This is one 
of two important properties we try to check for any scattering system; the other is 
the absence of the pathological states mentioned in Remark 5. The wave operators 
are asymptotically complete if they are complete and o.-(H) = @. Proof of 
asymptotic completeness represents the second basic problem of scattering theory, 
which is usually more complicated than the first. 

Of course, a rigorous scattering theory is not restricted to proving existence and 
asymptotic completeness of the wave operators. One also has to deduce the relations 
between the S—matrix and the cross section, which represents the true observable 
quantity, to give meaning to the stationary scattering theory, e.g., by proving the 
direct integral decomposition S = f® S(\) dA where S(A) is the on-shell S—matrix 
of the stationary theory; furthermore, one has to prove the dispersion relations, to 
introduce the Born series and check its convergence, to investigate scattering in a 
centrally symmetric potential (phase analysis in partial waves, Jost solutions, analyt- 
ical properties of the scattering amplitude, etc.); we suppose the reader has already 
encountered a formal version of these results in quantum mechanical textbooks. 

One should also introduce multichannel scattering formalism, allowing us to de- 
scribe scattering of three or more particles (or two particles with an internal struc- 
ture), to modify the definition of wave operators and other results for long-range 
potentials and for scattering on a time-dependent target, and also to investigate 
specific features of scattering in quantum field theory. However, such a program 
would require a separate book and we can only refer to the sources mentioned in 
the notes (see also Section 15.4). 
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15.2 Existence of wave operators 


Our aim is now to discuss some of the problems formulated at the end of the previous 
section, i.e., to find some sufficient conditions for the existence and completeness 
of wave operators. We start with several simple assertions which are valid in any 
scattering system if only the operators (15.8) exist. 


15.2.1 Proposition: (a) The wave operators 94(H,Ho) are partial isometries 
with the initial subspace Hac( Ho) . 


(b) The intertwining relations are valid: 


U(t)03.(H, Hy) = 04.(H, Ho)Uo(t) ; te R, 
03(H, Ho)Ho C HO4(H, Ho). 


Proof: The wave operators are bounded by definition. We have Hac(Ho)+ C Ker xz, 
and on the other hand, ||U(t)*Uo(t)Pac(Ho)w|| = ||z|| holds for any % € Hac( Ho) 
and t € IR. Since Ns = N(H, Ho) are supposed to exist, the left side tends to 
[Qs.~|| as t + too ; this proves (a). Consider next a fixed t € R ; then 


Q4 = slim U(s + t)"Uo(s + t)Pac(Ho) = U(t)" islim U(s)"Uo(s) Pac( Ho) Uo(t) ; 


because U(t), Uo(t) are bounded, the group {U(t)} is commutative and Uo(t) 
commutes with P,.(Ho) ; the first intertwining relation then follows from the uni- 
tarity of operator U(t). Now let w be an arbitrary vector from D(Hpo) ; then the 
boundedness of 2 implies 


Ud(t) — 1 -I 
|. ao w + iMsHoy|| < “yt w + iHow 


Due to the Stone theorem, the right side tends to zero as t — 0, so the function 
0,Uo(-)w is differentiable at t = 0 and its derivative equals —1Q2How. It follows 
from the already proven identity that 


sim Foy = —inatey, 


and therefore using the Stone theorem once again, we obtain Q47) € D(H) and 
AQsy=QLHoy. OF 
Similar intertwining relations hold for the adjoint operators 24, (Problem 5). 


15.2.2 Proposition (chain rule): Let H, H,, and Ho be self-adjoint. If 0.(H, H1) 
and Q4(H1, Ho) exist; then the same is true for the wave operators 24(H,Ho) and 


Q.(H, Ho) = OCH, H;) 04(A;, Ho) : 
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Proof: Denote U;(t) := e~*#1*. We have Ran 24(Hi, Ho) C Hac(H1) , and therefore 
gm (1—Pae(H,))U%()Uo(t)Pae(o) = (I—Pae( Hh) Ga(H, Ho) = 0. 


Next we use the decomposition 


U*(t)Uo(t) Pac(Ho) = U*(t)U1(t) Pac( H1)Ui (t)*Uo(t) Pac( Ho) 
+ U*()Ui(E) (I — Pac(H1))Ui(t)"Uo(t) Pac Ho) ; 


where the second term vanishes in the limit t — -too in view of the above rela- 
tion. By the sequential continuity of operator multiplication, the first term gives 
(HH, Hy) (Ai, Ho). | 


We shall also prove a simple completeness criterion. 


15.2.3 Proposition: The wave operators Q4(H, Ho) are complete iff 04.( Ho, H) 
exist. 

Proof: If Q4(H, Ho) are complete, P, = P,.(H) holds for the projections Px. 
onto the subspaces RanQx, and Problem 5 implies 01(Ho,H) = 04(H, Ho)*. 
Conversely, if Q4(Ho, H) exist, then the chain rule gives 04.(H, Ho) 4(Ho, H) = 
Q4(H, H) = P,.(H) ; hence Ran0s.(H, Ho) D H.-(H), and the opposite inclusion 
follows from Problem 3b. § 


After this introduction we want to derive some conditions which ensure the 
existence of wave operators. 


15.2.4 Theorem (Cook criterion): Let H, Ho be self-adjoint, and assume that 
there is a set D C D(Ho) NM Hec( Ho) , which is dense in Ha-(Ho) and such that for 
any w € D there is T, >0 with the following properties: 


(i) Uo(t)p € D(H) for |t| > Ty, 

(ii) the vector-valued functions (H — Ho)Uo(-)y are continuous on any compact 
subinterval of R\ (—Ty, Ty), 

(iii) frp |(H— Ho) Uo(£t)p|| dt < 00. 


Then the wave operators 24(H, Ho) exist. 

Proof: Denote w(t) := U(t)*Uo(t)w for ~ € D. In view of Problem 6a, the function 
w(-) is continuously differentiable in (Ty,0o) with w(t) = iU(t)*(H — Ho)Uo(t)y. 
By Proposition A.5.1, we have 


vs) - ver) s [Iw @liae = [|| H—Ho)Uoleyw tat. 


for any s > t > Ty. Furthermore, using the absolute continuity of the integral in 
combination with assumption (iii), we see that ||7(s)—wW(r)|| — 0 as r,s > 00, 
t.e., the limit 

lim U(t)"Uo(t) Pae( Ho) 
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exists for all y € D. It also exists for ~ € Hac(Ho)* (being equal to zero), and 
therefore for each w € (DU Hac(Ho)*)iin. Due to the assumption, we can find ~ 
from this set for any ¢ € H,€ > 0 in such a way that |\y—@l| < § ; using the 
standard trick we check that 


||(U(s)*Uo(s) — U(r)*Uo(r)) Pac( Ho)? || < € 


for all r,s large enough, i.e., that Q, exists. The existence of Q_ is proved in the 
same way. § 


Let us remark that it is sometimes not difficult to check assumption (ii) of the 
theorem (Problem 7). It is often useful to introduce the generalized wave operators 
corresponding to a given B € B(H) ; they are defined by 


0.4(H, Ho; B) := s-lim U(t)* BUo(t) Pac( Ho) - 
As an illustration, we prove the following modification of the previous theorem. 


15.2.5 Theorem (Kupsch-Sandhas): Let H, Ho be self-adjoint and F € B(H). 
Suppose that the assumptions of Theorem 4 are valid with the exception of (iii), 
which is replaced by 


Z ” |\(HB—BHo)Uo(+t)w)|| dt < 00, 


where B := I—F. If F(Ho— z)~® is compact for some z € p(H), @ > 0, and 
D Cc D((Ho— z)®), the wave operators Q4(H, Ho) exist. 

Proof: As in the previous theorem, we can prove the existence of 04(H, Ho; B) ; 
hence it is sufficient to check that 04.(H, Ho; F’) = 0, which is equivalent to 


‘lim, [|FUo(t)¥| = 0 


for any ~ € Hac(Ho). We choose first % € D ; then we have ||FUo(t) Pac(Ho)|| = 
| F’(Ho — z)~8Uo(t)(Ho— z)®)|| , where we have used the fact that functions of Ho 
commute mutually, together with the inclusion D C H,.(Ho). By assumption, 
F(Ho — z)~® is compact, so the condition is satisfied for any ~ € D in view of 
Problem 1a; the proof is completed by the density argument. 


Next we are going to formulate several existence conditions based, rougly speak- 
ing, on trace-class properties of the interaction Hamiltonian. They can be derived 
from the following fundamental theorem. 


15.2.6 Theorem (Pearson): Let H, Ho be self-adjoint and B € B(H). Suppose 
that there is a trace-class operator C' such that (Hd, Bw) — (¢, BHow) = (¢, Cy) 


for all ¢ € D(H), w € D(Ho) ; then the generalized wave operators Q3.(H, Ho; B) 
exist. 


First we shall derive some general properties of the absolutely continuous spec- 
tral subspace Ha-(A) for an arbitrary A € L4.. Recall that the Radon—-Nikodym 
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theorem associates with each ~ € Ha-(A) a unique fy € L'(JR) , which is positive 
a.e. in JR and satisfies the relation 


y(M) = |Ea(Myvl? =f folt)at 


for any Borel M C R, in particular, |||? = || full. 


15.2.7 Lemma: The set M(A) := {7 € Hac(A) : Il fulleo < 00} has the following 
properties: 


(a) M(A) is a dense subspace in H,-(A), 
(b) Sr \(¢,e*“*p)|?dt < 27 |I9 I?ll fulloo holds for any y € M(A) and gE H. 


Proof: The fact that M(A) is a subspace in H,-(A) follows from the inequality 
Have S 2\a2Hy + 2g, which implies ||faysolles < 2la?llfylloo + 2llfelleo for any 
w~,¢€ M(A) and a € €. Next we shall construct to an arbitrary 7 € Ha-(A) a 
sequence {tn} C M(A) such that ¥, — 7. The corresponding vector f, € L' (JR) 
can be represented by a Borel function which is everywhere positive and finite; 
with an abuse of notation we shall employ the symbol f, again. The Borel sets 
Sp:={teR: f,(t) >n} obviously form a nonincreasing family and (1%, Sn = @. 
We set , := (I—E4(S,))7 ; then 


wy,(M) = | (1—xs,(#)) dun(t) = fxs, (0) falt) ae, 


which shows that %, € Hac(A) and the corresponding function is fy, := (l-vs,) fr 3 
it follows from the construction that || fy, lo <n, s0 %n € M(A). Finally, the 
relation s-limp_..of4(Sn) = E4(0) = 0 implies %, — 7, and consequently, M(A) 
is a dense subspace in H,,(A). 

To prove (b) we introduce the measure Vyg(-) := (%, Fa(-)d). Using the Schwarz 
inequality together with the fact that the projection P,.(A) onto H-(A) commutes 
with A, we find 

lYyp(M)|? < py(M)in(M), 
where 7 := P,,(A)¢. This means that vyg(-) is absolutely continuous with respect 
to the Lebesgue measure, and therefore there is a unique gyg € L'(JR) such that 
Vpo(M) = fue Gue(t) dt holds for all M € B. It follows then from Proposition A.4.5 


that 
(dye) = fe dvya() = fe gue(A) dA. 


Hence we need to check that gyg € L?(UR) , in which case the last relation may be 
rewritten as (2),e7*4°¢) = V2 (Fgyg)(t) and the unitarity implies 


[ Wer*9) Pat = 2m lowell 


The function vyg[-] := vys((—00,:)) is absolutely continuous in R, so $ vyg[t] = 
gue(t) with the possible exception of a set Nyg of Lebesgue measure zero. The 
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same is true for fy = Vyy ; then using the above mentioned consequence of the 
Schwarz inequality for the interval M with the endpoints t and t+h, we obtain 


Yyuglt+h] — walt] 
h 
for all t € IR\ (Nye U Nyy U Nan) and h #0. Since y € M(A), the limit h — 0 


yields [guo(t)? < fult)fa(t) < lifulloofa(t). We have f, € L'(IR) by assumption, 
and therefore gyg € L?(IR) with 


llgvell? < [felloollfalls < Wfulleollall? < Wfulloll#ll? 
where we have used 7 = P,,(A)@; this concludes the proof. § 


Proof of Theorem 6: Denote 2(t) := U(t)*BUo(t) and 4, := O(t)-—Q(s). We shall 
prove the existence of 0,(H, Ho; B) ; the argument for Q_(H, Ho; B) is analogous. 
As in the proof of Theorem 4, it is sufficient to show that 


< F (uylt-+h] — wlel) £ alt] — alt) 


‘lim, [[,.nl] = 0 


holds for all 7 of some dense subspace; we choose for it the set M(Hpo) intro- 
duced above. First we shall find to any ¢ € H an integral representation of the 
vector %,@. Using Problem 6b, we readily check that the function (p,2(-)w) is 
differentiable for all ~ € D(H), ~ € D(Ho) and 


£ (W209) = £ (BUY, Uoldd) = (W.U()"CU(t)¥).. 


The derivative is clearly continuous, so the function (w,(-)q) is absolutely con- 
tinuous; then we have 


(,%8) = (ds (2@) ~ 3) 8) = + f° (,U)*CU( Va) dr 


for all s,t € IR. Since the vector-valued function t + U(t)*CUo(t)¢@ is continuous 
for any @ € H, the Bochner integral J:s(¢) := ff U(r)*CUo(r) ddr exists; the 
map J;,(-) is linear and ||J:5(@)|| < |t—<s| ||C'l] ||@|| follows from Proposition A.5.1, 
so Jt, € B(H). Moreover, Proposition A.5.2 gives (b,.%, 50) = i(y, Jis(W)) for all 
we D(H) and y € D(Ho). Thus ,, = iJj;, and the sought integral expression is 
of the form 


2.6 = i f "Glribdr, Clr) := U(r)*CUd(r). 


Due to Problems 6b and 8b, the function 7 +> ||Q4,47@||? is continuously diffe- 
rentiable for any ¢ € H and 


d 
Wt" (T) = FIMernerrO|l? = 2Re (Q4nsard, (Clt+7)-C(s+7))¢) 


2 Re (%,,sUo(r)¢, (C(t)—C(s))Uo(r)4) . 
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Since the function on the left side is continuous, we have 


IQerarrGl? — Isl? = f° wl2(r) dr. 


By Problem 8a, the operator %, is compact for any s,t € IR, and therefore 
lim, 50 Qe¢pnsarl = limpco U(r)*% sUo(r)n = 0 holds for 7 € M(Ho) C Hac(Ho) ; 
it follows that e 

[om|? = — Jim f w(x) dr. 


It remains to check that w” is integrable and limst 00. {o° wht (Far = Or liis 


sufficient to show that the continuous function uv” : IR+ — IR+ defined by 

ul? (r) = |(2¢,sVo(r)n, C(t)Uo(T)n)| = |(U (t)Q%, Uo(7)n, CUo(t+7)n)|, 
because it satisfies the inequality |w{”(r)| < 2(u%(r) + u(r). To this end, we 
write the operator C’ in the canonical form (3.10), C = 1; u(j) (Y;,-)¢;, where 


{w;}, {¢;} are orthonormal bases in H and 3; u(j) = Tr|C| < oo. Using the 
monotone—convergence theorem together with the Hélder inequality, we find 


[ & oa < Dw) 


se (f° ibs, Vott-+7)n) Par [We ,vor)n.¢)Pdr) 


Since n € M(Ho), both integrals can be estimated by Lemma 7b, 


F ices, Volt+r)nyPar = (yj, Uol2)n)Pdz < 2l fall 


[SU %.o(r)n,4)Pdr < Alfa lool (O)*4s|? < Srl fllooll BI? 


the last inequality follows from ||Q%F || = ||Q,«l] < ]Q(4)|] + ]Q(s)|] < 2|| Bl]. Denot- 
ing 7;(t) = (I? |: Up(z)n)|2dz * we obtain in this way the inequality 


0< [PP r)dr < ABI nFalle)"? DO w(t) < SalLBl fall TCI. 
J 


The functions +;(-) are uniformly bounded, so the series )7; u(j)7,;(t) converges 
uniformly with respect to t. On the other hand, lim:.7,;(t) = 0 holds by 


Lemma 7b; then the last estimate gives the relation lim:oo. f° uf (7 )dr="0, 
and therefore also lim, too fo” uy? (r)dr=0. Of 

Choosing B = I in the just proved theorem, in combination with Proposition 3, 
we obtain the following result. 
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15.2.8 Corollary (Kato-Rosenblum theorem): Let H = Ho +V, where Ho is 
self-adjoint and V is a Hermitean trace-class operator; then the wave operators 
0.(H, Ho) exist and are complete. 


A simple application is given in Problem 9a. However, Theorem 6 allows us to 
derive other sufficient conditions covering much wider classes of interactions. 


15.2.9 Theorem (Birman—Kuroda): Suppose that H, Ho are self-adjoint and 
(H—z)-!—(Ho— z)~! € A(H) for some z € p(H)Mp(Ho) ; then the wave operators 
04.(H, Ho) exist and are complete. 

Proof: We set. B := (H—z)~'(Ho— z)~! and C := (Ho— z)7!— (H—-z)~' ; then 


(Hd, By) atl (¢, BHow) = ((H—z)¢, By) _ (9, B(Ho- z)p) = (¢, Cw) 


holds for any ¢ € D(H), w € D(Ho). Theorem 6 then implies existence of the limits 
limy_.too U(t)*(H — z)~!(Ho — z)~'Uo(t)Pac(Ho)n for any n € H. Choosing 7 := 
(Ho—z)w we see that limy_.4.. U(t)*(H—z)~'Uo(t) Pac(Ho)y exist for all yy € D( Ho). 
However, this domain is dense and ||U(t)*(H — z)~!Uo(t)Pac(Ho)|| < ||(H —z)7"|| 
for all t € IR, so repeating the standard density trick once more, we establish the 
existence of 

slim U(t)*(H — 2)""Uo(t) Pac( Ho) 


Next we use Problem la; since operator C is compact by assumption, we obtain 
s-lim¢. 400 U(t)*CUpo(t) Pac( Ho) = 0, which means in combination with the above 
result that the limits limys.. U(t)*(Ho— z)~'Uo(t) Pac(Ho)n exist for any n € H. 
Choosing 7 = (Ho— z)y, we conclude that 


jlign, U(t)"Uo(t) Pac(Ha)y 


exists for all y% € D(H), and by the density argument, 04(H, Ho) exist. Fi- 
nally, the roles of H and Hp in the above argument may be interchanged; hence 
completeness follows from Proposition 3. 


To derive one more consequence of Theorem 6 we introduce the following relation 
between a pair of self-adjoint operators H, Ho : we say that H is dominated by 
Ho if there are functions f, fo : IR — [1,00) such that limjyioo f(x) = 00, 
D(f(H)) > D(fo(Ho)) and the operator f(H)fo(Ho)~! is bounded. If Ho is at 
the same time dominated by H , the operators are called mutually dominated; this 
is true, e.g., if D(H) = D(H) (Problem 10). 


15.2.10 Theorem (Birman): Let the self-adjoint operators H, Ho satisfy the fol- 
lowing conditions: 


(i) Ex(J)(H-—Ho)Ex.(J) belongs to the trace class for any bounded interval J, 


(ii) H and Ho are mutually dominated. 
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Then the wave operators 4(H, Ho) exist and are complete. 
Proof: To any A € IR*+ we define By := Exy(Jy)Ex,(Jx), where Jy := (-A,A). 
The identity 

(H¢, Br) — (6, BxHod) = (¢, Fx (Jy)(H — Ho) Eno(Jx)¥) 


holds for any ¢ € D(H) and w € D(H), so assumption (i) together with The- 
orem 6 ensures existence of the generalized wave operators 04.(H, Ho; By). The 
spectral-measure properties imply for a fixed Ay € IR and w € Ran Ey,(Ao) that 
the limits 

slim, U(0)* En (Jn)Uo(t) Pae(Ho)y 


exist provided A > Ao. We shall prove that 
Jim sup ||(7—Ex(Jy))U0(t) Pac(Ho)vll = 0 
oo tER 


holds at the same time for ~ € Ran Ey,(J),). In view of (ii), H is dominated by 
Ho ; hence there are functions f, fo with the above stated properties. We denote 
F(A) := inf { f(€) : |€| > A} ; by definition we have lim,.. F(A) = 00 and 


I| (1— En (Ja)) Uo(t) Pac Ho)? || 
< || (—En(Ja)) f(A)" Wl FCA) fo( Ho)" || fo(Ho)Uo(t) Pac( Ho) Exo (Jno) II 


< F(A) | f(A) fo( Ho)" i fo( Ho) Uo(t) Etto( Joo) oll 


for any \ > Ao. The first of the norms on the right side is finite due to (ii), and since 
J,, is a bounded interval, the same follows for the second from functional—calculus 
rules; this yields the above limiting relation. Now we have 


|| (U(t)*Uo(t) — U(s)"Uo(s)) Pac( Ho)? || 
< || (U(t)*En(Jy)Uo(t) — U(s)* En (Jy)U0(s)) Pac( Ho) || 


+ || I—En(Jy)) Volt) Pacl Ho)? | + || (— En(Ja)) Vols) Pac( Ho) Il - 


Given € > 0 we can find a A > o such that each of the last two terms is < §. 
For this \, there is a t) such that the first term is also < § for s,t > to ; together 
we have shown that the limits 


slim, U(t)"Uo(t) Pae( Ho) 
exist for 7 € Ran Ey,(J,,). Since Apo is an arbitrary number, the set of these vectors 


is dense in H and 01(H, Ho) exist. Finally, conditions (i),(ii) are symmetric with 
respect to operators H, Ho , so completeness follows again from Proposition 3. 
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In conclusion, we shall mention one result which allows us to extend substan- 
tially the domain of applicability of the above derived existence and completeness 
conditions; references to the proof are given in the notes. 


15.2.11 Theorem (invariance principle): Let the function py: J — R on an 
open interval J C R be strictly monotonic and piecewise differentiable with y’ 
absolutely continuous. Assume that Ao, A; are self-adjoint, o(A;) C J ; and at 
each of the endpoints of J either y has a finite limit or the point is not an eigenvalue 
of any of the operators A;. Moreover, let one of the following conditions be valid: 


(i) A, — Ap € A(H), 
(ii) (Ay — z)~!— (Ap — z)7! € Z(H) for some z € p(Ao) Np(A1). 
Then the wave operators 04(~(A1), ~(Ao)) exist, are complete, and 


4(A,,Ao) .-- Y increasing 
M(p(A1), (Ao) = eek ae ... y decreasing 


15.2.12 Corollary: Let self-adjoint H, Ho satisfy one of the conditions: 


(a) both the operators H, Ho are positive, and either H?— H? € J,(H) or 
(H?+ a?)-!— (H2 +.a7)-! € Ai(H) for some a >0, 


(b) e~84— e~8Ho € J,(H) for some positive 6. 


Then the wave operators 04(H, Ho) exist and are complete. 

Proof: In the first case, it is sufficient to choose J := JR with y(z) := /x for r>0 
and y(x) := x for x <0. If assumption (b) is valid, we use y(z) := —— logz on 
J t{0,00). F 


15.3 Potential scattering 


Now we want to show what follows from the existence results of the preceding 
section in the case when H is a Schrédinger operator with a potential V and Ho 
is the corresponding free Hamiltonian. Let us first consider two-particle scattering, 
where after separating the center-of-mass motion we have to compare Schrédinger 
operators H, Hg on L?(JR°). 


15.3.1 Theorem (Hack-Cook): Let H = Ho + V(Q), where V € (L? + L*)(UR°) 
for some s € [2,3) ; then the wave operators exist. 


Proof: We shall use the Cook criterion. Operator H is self-adjoint due to Prob- 
lem 11; we choose for D the subspace in L?(JR°) spanned by the vectors 


tq : Pelz) = 29/4 eT e-al/2 


ae 
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with q € JR® ; then we obviously have D C S(JR?) C D(Ho) and it is necessary-to 
check that D is dense in H,,(Ho) = L?(JR°). However, the 7, can be expressed 
as tensor products of the vectors y,, from Example 8.1.7 with p=0 and qgé€ R, 
which are dense in L*(JR) in view of Theorem 2.2.7 (because an entire function 
vanishes everywhere if it vanishes on IR), so the required property follows from 
Proposition 2.4.4a. Since D(H) = D(Hpo) follows from Problem 11, the condition 
Uo({t)D C D(H) is fulfilled automatically and assumption (ii) of Theorem 15.2.4 
holds due to Problem 7; it remains for us to check assumption (iii). The free 
propagator acts on vectors of the subspace D in the way given by Theorem 9.3.1, 


(Uo(t)vq)(z) = 1279/4 a(t)3/4 e~ OO +) )lz—al?/2 


where a(t) := (1+4t*)"' and b(t) := —2ta(t)+ 3 arg (3 + it) (Problem 12a). 

Next we define 7% := (I + |Q|)®Up(t)W, for any positive @ and t € IR, where 
1/2 

ii ( a Q?) ‘ . This function satisfies the inequality 


yn 


sup |o{"(x)| < (n(1 + 4t?) max (1 + |g| + y)8 e72?/20+4) 
zéR3 y>0 


and the maximum of the right side is not difficult to find; using the inequality 
Vl+z<1+ Vz, we get the estimate 


Saja B 
sup |v? (@)| = (m(1 +41") (1 + (ql + Jaa 4#?)) 
so there is a positive K, such that 


I(T + 1Q1)?Uo(t)dalloo < Kg (1+ |tl)~97*? 


for all t € JR ; this allows us to estimate the expression, which appears in assumption 
(iii), for H—Ho = V(Q) as follows: 


IV(Q)Uo(t) ball < VCQVL + |QI)-FIl Ky (1 + [e182 44 
(Problem 12b). Due to the assumption, V = V2+ V, with V, € L”, so we obtain 
IV (Q)Wo(t)ball < {Vall + IV(Q)VT + QF} Ky. + [22 *4, 


because ||(J+|Q])~?||.o < 1. The first norm in the curly brackets is finite, and the 
second can be estimated by the Holder inequality, 


ao 0 _ 26a i 
VQ + 1QNAIP < Wal (An [ery rtd) 
the right side is finite provided 6 > 3e=2) . The integral of assumption (iii) exists if 


@ <4 and we readily verify that for any s € [2,3) there is a @ which obeys these 
restrictions. 
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The existence of wave operators in some cases provides us with information 
about the absolutely continuous spectrum of the full Hamiltonian. 


15.3.2 Example: Due to Problem 3a, the operators Ho} Hac(Ho) and H } Ran Qs 
are unitarily equivalent; combining this result with Proposition 4.2.6, we obtain 


Cal Hol OGun(H) Cogs). 


Consider, e.g., Schrodinger operator H = Hy + V(Q) on L?(/R*) with a potential 
V , which satisfies the following conditions: 


(i) Ve Li(R*), 
(ii) there are positive C, R, € such that |V(z)| < Cr-)-* if r:=|z|>R. 


Such a potential belongs to L? + L* with s € (3(1+.€)7!,3) , so the wave operators 
exist. At the same time, V € L* + L&, so o¢,,(H) = IR* by Theorem 14.4.1. 
Finally, we know from Example 7.5.8 that oo-(Ho) = IR* . Hence we get oa-(H) = 
IRt+ , i.e., Proposition 14.4.6. 


15.3.3 Remark: If the wave operators are complete, then we have the identity 
Oac(Ho) = Oac(H) instead of the inclusion, and we need not know the essential 
spectrum. The trouble is that it is usually easier to find o-;,(H) than to check the 
completeness of wave operators. 


One of the ways to prove the completeness is based on the Kato-Birman theory 
discussed in the previous section. Let us mention a typical result. 


15.3.4 Theorem: Let H = Ho+V(Q) be a Schrédinger operator on L?(JR*) with 
V € (L'N L?)(UR3) ; then the wave operators 04(H, Ho) exist and are complete. 
Proof: In view of Theorem 14.1.2, the domains of the two operators coincide, 
D(H) = D(H), so they are mutually dominated and the Birman theorem may 
be used. It is sufficient to check that Ey(J)|V(Q)|!/? and |V(Q)|!/?Ex,(J) are 
Hilbert-Schmidt for any bounded J C R, since then their product multiplied by 
sgn V(Q) € B(H) will belong to 7,(H). As for the second operator, this follows 
from Proposition 15.1.3 because |V(Q)|!/? € L? due to the assumption. In the 
first case it is enough to check that (H—2)-'|V(Q)|'/ is Hilbert-Schmidt, because 
Ex(J)(H— 2) is bounded. The second resolvent formula gives 


(H — )"1V(Q)|!? = [1+ (Ho - d)V(Q)]* (Ho — A 'IV(Q)E”? 


provided the inverse operator exists. Proposition 15.1.3 and Problem 2 yield for 
A <0 the inequality 


|(- ava s Share; 


hence by choosing a sufficiently large negative a the norm can be made smaller 
than one, in which case the operator C := I + (Hy—A)7!V(Q) is invertible. By 
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another application of Proposition 15.1.3, (Ho—A)7|V(Q)|!/? is Hilbert-Schmidt 
and its product with the bounded operator C~! is also Hilbert-Schmidt. 


Compared with Theorem 1, the just proved result requires a much faster decay 
of the potential at large distances. Roughly speaking, we need |V(x)| < Cr-** 
for large values of r. This restriction can be weakened, e.g., for central potentials 
where the Birman—Kuroda theorem applies (see the notes). 


15.3.5 Theorem: Let V be a central potential on IR?, V(x) = v(r) for a 
measurable function v: IR*+ — IR, which satisfies the condition 


i rlu(r)|dr + lu(r)|dr < oo. 


Then V(Q) is Ho~bounded with zero relative bound, and the corresponding wave 
operators 24(Ho+V(Q), Ho) exist and are complete. 


Let us finally say a few words about asymptotic completeness. The most dif- 
ficult part is usually to check that the singularly continuous spectrum is void. In 
Section 14.4 we have derived one sufficient condition; in combination with Theorem 4 
it provides the following result. 


15.3.6 Theorem: Let H = Hy) + V(Q) be a Schrodinger operator on L?(JR?) 
with a potential V € (L'NL*)(JR*) which satisfies the condition of Theorem 14.4.8; 
then the wave operators 14(H, Ho) are asymptotically complete. In addition, the 
spectrum of H is purely absolutely continuous, so 024(H, Ho) are unitary. 


One can prove asymptotic completeness for much wider classes of potentials; 
however, the methods used to this aim are not simple. We limit ourselves to quoting 
without proof two important results (see the notes). A potential V : JR” — RR is 
said to satisfy the Agmon condition if there is an € > 0 such that the operator Ty 
of multiplication by f : f(x) = (1 + |z|*)!/*+*V(x) is Ho-compact. An example is 
any potential of the form V(x) := (1 + |x|?)~!/*-* f(x) with f € (L? + L&)(UR"), 
where p fulfils the conditions (14.2). 

In order to introduce the second mentioned class of potentials we employ the 
“localization” projections F onto the subspaces L?(B,) which we described loosely 
in Section 15.1; recall that in the simplest case of a single particle scattered by a 
potential the natural choice for B, is the family of concentric balls B, parametrized 
by the radius. Assume that the potential V : IR" — RR gives rise to a Ho-bounded 
operator with a relative bound < 1 , and furthermore, that V(Q)(Ho +i)7! is 
bounded and one may define 


h: h(r) = |(-F,)V(Q)(Ho+i)""] . 


It is easy to check that h: IR*t — JR* is nonincreasing with lim,_,.. h(r) = 0; if it 
belongs to L*(R+) the potential is said to satisfy the Enss condition. 


15.3.7 Theorem: Let H = Ho+V(Q) bea Schrédinger operator on L?(JR") with 
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a potential V , which satisfies one of the two above stated conditions; then the wave 
operators exist and are asymptotically complete. 


15.3.8 Remarks: (a) The classes of potentials specified by the Agmon and Enss 
conditions are similar. To illustrate this, take V(x) := (1 + |z|?)~'/*-* f(z), 
for which we find 


h(r) < (L+9?)-¥?°* || Ty(Ho + i)7"Il 5 


hence V satisfies the Enss condition if V(Q) is Ho-bounded with a relative 
bound < 1 ; recall also that a Ho-compact operator is Ho-bounded with 
zero relative bound. On the other hand, the methods by which one proves 
Theorem 7 are substantially different for the two conditions; the Enss method 
is technically simpler and admits an extension to scattering in N-particle 
systems (see the notes). 


(b) It follows from the theorem that oac(H) = Gac(Ho) = IR* ; if the potential 
satisfies the Agmon condition, we have also o.,,(H) = IR* because the oper- 
ator V(Q) is Ho-compact. This is not true for the Enss condition; however, 
one can prove that H has in this case no nonzero eigenvalues of infinite mul- 
tiplicity, and that A = 0 is the only possible accumulation point of o,(H). 
Since o,-(H) = @ and oa-(H) = IR* , we see that any negative A € o(#) is 
an isolated point of the spectrum with dim Ey({A}) < 00,80 Gess(H) C R*. 
Since the opposite implication is also valid due to Example 2, we find in case 
of the Enss condition again o.,,(H) = Rt. 


15.4 A model of two—channel scattering 


As mentioned in Section 15.1, the scope of the present book does not allow us to 
discuss the rigorous scattering theory to its full extent. To give the reader just a 
flavor of what was left out, we are going to discuss in the final section a model 
which is explicitly solvable, and at the same time, it exhibits some features of real- 
istic scattering systems, namely simultaneous occurrence of the elastic and inelastic 
scattering, and a resonance scattering in the elastic channel. 

The model will describe a pair of nonrelativistic particles, which may exist in two 
states; for definiteness we can imagine a neutron plus a nucleus having two internal 
states: the ground state and an excited state. After separating the center-of-mass 
motion, the state Hilbert space is therefore G = G, ®Go with G; := L?(IR*). Since 
the model is supposed to be Galilei-invariant, the Bargmann superselection rule 
requires the reduced masses in the two channels to be the same (see Remark 10.3.2); 
for simplicity we put fy, = p2 = ; . Our basic assumption is that the full interaction 
between the particles, including the part which is responsible for the transitions 
between the two channels, has a contact nature, i.e., it is supported by the origin of 
coordinates in the relative-coordinate space only. 
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Such an interaction can be constructed by the procedure described in Section 
14.6; we start from the Hamiltonian (or its e.s.a. restriction) which describes the 
free motion in the two channels, and restrict it to the set of functions whose supports 
are separated from the origin. In this way we get a non-selfadjoint operator Ag := 
Ao, ® Ao2 with 


Ao, 


—A, D(Ao,1) = Ce? (R \ {0}) , 


Ao2 = —A+E, D(Ao2) = CyUR* \ {0}), 


where £ is a positive number representing the threshold energy of the inelastic 
channel. To specify the dynamics, we have to say what happens when the particles 
“hit each other”; this will again be achieved by choosing the Hamiltonian among 
self-adjoint extensions of Ap. 

These extensions can easily be constructed since the deficiency indices of Ag 
are (2,2). This is seen from the partial-wave decomposition in the same way as 
in Theorem 14.6.4; due to the presence of the centrifugal barrier, the component 
of Ag; in the @th partial wave is e.s.a. if € > 1, so only the s-wave parts can 
be coupled. In this way the problem is substantially simplified: using the unitary 
operator (11.16) we pass to the reduced radial wave functions f : f(r) := ry(r) ; 
then we may consider H = H, ® H2 with H,; := L?(Rt) as the state space of 
the problem, and the “starting operator” will be Ho := Ho, ® Ho,2, where 


a 
Ao. = a D(Ho,1) = Co°(0, 00) , 
(15.10) 
d = 
Ao = are) +E, D(Ho2) = Cy°(0,00). 


The operator Ho has the deficiency indices (2,2), and therefore a four-parameter 
family of self-adjoint extensions. The most convenient way to characterize them is 
again through suitable boundary conditions. 


15.4.1 Proposition: For any u := {a,b,c} with a,b € IR and ce € denote by 
H,, the operator defined by the same differential expression (15.10) as Ho , with the 
domain D(H,,) C D(H) specified by the boundary conditions 


f,(0) = af,(0) + cfo(0), —f2(0) = efi(0) + bf2(0). (15.11) 


Then H,, is a self-adjoint extension of Ho. 


Proof: Using the results of Section 4.8, we obtain D(Hj) = AC?(R*) @ AC?(R*), 
and the Lagrange formula of Problem 4.53 yields 


(H3f,9) = £1(0)91(0) + f2(0)92(0) — fr(0)9,(0) — fo(0)92(0) + (f, Hog) 


for any f,g € D(H5) ; it is simple to check that the boundary term vanishes under 
the conditions (15.11). 
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In addition to the extensions specified here there are some exceptional ones (for 
instance, those obtained formally by inserting a = oo or b =o into the boundary 
conditions). These can be described by the standard methods discussed in Chapter 4 
(cf. Problem 13a), but for the purposes of our model it is sufficient to know the 
“most part” of the admissible Hamiltonians. 

It is obvious that c is the complex coupling constant between the channels; 
if c = 0 we have H, = Ha © Hy, where the last named operators correspond 
to the s-wave parts of the point-interaction Hamiltonians H, and Hgo in the 
two channels (cf. Theorem 14.6.4) with the interaction strengths a := a/4x and 
3 := 6/4, respectively. Hence the spectrum of H,, in the case of decoupled channels 
is easily obtained from the spectra of Ha and Hg ; below we shall discuss o(H,) 
in the interacting case. 


15.4.2 Remark: The above specified family of Hamiltonians is restricted further 
if we demand the interaction to be time—reversal invariant — see Problem 13b. In 
fact, we can easily check that H, and Hy with a’ =a, b’=b and c = ce™ are 
unitarily equivalent by the operator U : U(f, ® fo) =e/*f, @e7 fa. 


To find the spectrum of H,, we first have to determine its resolvent. The form of 


the Hamiltonian allows us to do it explicitly. For simplicity, we shall write elements 


of H as columns, f = ( # ) 


15.4.3 Proposition: For any z € p(H,,), we denote k := ,/z. The resolvent 
(H.—z)~' is an integral operator on H with the kernel 


etkir-+e’|_pikir—r’| 


f, = 2k 0 
G(r, 1°; k) = ( A elsirtr’|_pinirar’| 


0 2t« 


4p" (b—ix) ek(r+1’) — celikrtnr’) 
Eetlartkr’) (a— ik) eis(r+r’) 


where D := (a—ik)(b—ix) — |c|? and « := Jk?—E. 
Proof: If up := {a,b,0} the operator H,, decouples into an orthogonal sum, so 


its resolvent kernel expresses through the known resolvent kernels of the point- 
interaction Hamiltonians H, and Hy, as 


" = Galt, 7’; k) 0 
ee ( 0 Fe) 


where 
: ; . 
7 etkirtr’| _ etklr—r’ | etkr eikr’ 
ga(r,7 5k) = ——_——_—— 
2ik a—tk 


and gp is obtained by replacing a,k with b,«, respectively. Since H,, and H,, 
are self-adjoint extensions of the same symmetric operator Ho with finite deficiency 
indices, the resolvent of Hy can be expressed by means of Theorem 4.7.15: its kernel 
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equals 
2 
Gulr,r';k) = Gulrr3k) + Do Aj(k?) FF) Ff (r') 
jl=l 


where t means transposition and the 2 x 1 columns Fe can be chosen as 


ios ga(r, 0; k) Xe 0 ‘ 
nvm (28%), = (eh) 


this ensures that the F;* solve the equation (Hj —k”)F* = 0, i.e., they belong to 
the deficiency subspaces of operator Ho. It remains to find the functions ,:(-). We 
‘use the fact that the resolvent (H,—z)~! maps H into D(H,), which means that 
for any f € H, the component functions of (H,—z)~!f must satisfy the boundary 
conditions (15.11). This requirement yields a system of four linear equations, 


Au (k? 
Mit) = Ent), an) =e(14 20), 
A20(k?) c 
- aye 22 a 2) _ 2 
Aio(k ) e(1 + b—ik ; A2o(k ) anak ak i; 


which is readily solved (Problem 13c); substituting into the above expression of 
Guo(r,1’; k) , we arrive at the result. & 


The explicit form of the resolvent makes it possible to study its singularities. In 
fact, it is sufficient to find zeros of the “discriminant” D appearing in the resolvent 
kernel. To see this, notice that for z € € \ IR*, the resolvent can be written as 
(H,,—z)~! = A(z) + D7!(z)B(z), where the two operators corresponding to the 
matrices in Proposition 3 are bounded, and therefore they cannot give rise to a 
pole. On the other hand, we can check directly from Proposition 1 that H,, has no 
embedded eigenvalue unless c = 0. Of course, the resolvent also has cuts associated 
with the continuous spectrum, which come from the square roots in definitions of 
k and x, but they are independent of the interaction. 

Consider first the decoupled case, c = 0. The discriminant then factorizes, so 
D =0 holds iff k = —ia or k = —tb. By Problem 4.24b, the spectrum is the union 
of spectra of the two point—interaction operators, the second of which is shifted by 
the constant E'. We have to distinguish several cases: 


a<0 ... 4H, has the eigenvalue —a? corresponding to the normalized eigen- 
function f(r) = /—2a ( - ); 
a>0Q ... the pole now corresponds to a “zero-energy resonance” (a = 0) or 


to an “antibound state” (7.e., a resonance on the negative imaginary axis of 
k — deeply “hidden” on the second sheet of energy), 


b<0 ... Ay, has the eigenvalue F'—b? corresponding to V—26 ( me i 
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BS Oh 2% H,, has a zero-energy resonance or an antibound state. 


The continuous spectrum of the decoupled operator is again the union of the con- 
tinuous spectra of H, and Hb, i.e., it covers the interval [0,00) being simple in 
(0, £) and of multiplicity two in [E,oo) (cf. Problem 13d; we say that the spec- 
trum of a self-adjoint A has multiplicity n in an interval J if E4(J)A is unitarily 
equivalent to @f_y AY , where each Ae has a simple spectrum). Our main in- 
terest concerns the cases when both a,b are negative; among these, we shall pay 
particular attention to the situation when 6? < E, i.e., the eigenvalue of Hy is 
embedded into the continuous spectrum of H,. 

After this preliminary, let us turn to the interacting case, c # 0. Since the 
deficiency indices of Ho have been finite, the essential spectrum is not affected by 
the interaction due to Theorem 4.7.14. To find the eigenvalues (or resonances) of 
H,,, we have to solve the equation 


(a—ik) (b-iVk?—E) = |ci?. (15.12) 


It reduces to a quartic equation, and can therefore be solved in terms of radicals; 
however, the formulas expressing the roots for general a,b, and c are too compli- 
cated to be of practical use. We shall therefore concentrate our attention on the 
weak-coupling case in which the equation (15.12) can be solved perturbatively; at 
the same time it represents one of the physically most interesting situations. 


15.4.4 Theorem: (a) Let the point spectrum of the decoupled operator H,, be 
nondegenerate, —a? # E—b?; then the perturbed first-channel eigenvalue 
(resonance) behaves for small |c| as 


9 2a|c|* a* — F—bvVa?+E ' ‘ 
+ Va?+ Va? +E (b+ va?+ E) 
In particular, the zero-energy resonance corresponding to a = 0 turns into 


an antibound state if H,, has an isolated eigenvalue in the second channel, 
b < —VE, and into a bound state otherwise. 


e,(c) = —a 


(b) Under the same assumption, let H,,, have an isolated eigenvalue in the second 
channel, b < —VE; then the perturbation changes it into 
2b|c|? b+ E-—aV/e—E 
este) Som oF 4 ty Vt BE a a cgay 


a+VP-E JE (a+ VE) 


On the other hand, if H,, has an embedded eigenvalue, men vere Ov, 
then the latter under the perturbation gives rise to a pole of the analytically 
continued resolvent with 


2ab|c|? 


— 2 
Reex(c) = E — 0° + P+ EB + O(\cl*) , 
2bic|? VE— 
Imeo(c) = i a + O(|c|*). 


a?—b?+ EF 
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(c) Finally, suppose that H,,, has a nonsimple eigenvalue, b = —/a?+ E. Under 
the perturbation, it splits into 


ae aie 2 


ey? Se i oma a A 
€12(C) a® = 2V—-ava?+ Ec] + Qala2-+ B)572 


lel? + O(|c[*). 
Proof: We rewrite the spectral condition (15.12) as (a — iz) (2 — iV 27— j= rl? 


where a :=a/VE and (,7,z are similarly renormalized a,b,k. To prove part (a), 
we put z = ~—ia+ © c,|7\". Substituting this into the above condition and 
expanding the left side in powers of ||, we get an infinite system of equations, 


—ic(8+A)—-1 = 0, 
—ico(84+ A) + acjA = 0, 
~ic3(B + A) + 2acyc,A7! + saa = 
etc., where A := Va?+1, which can be solved recursively (Problem 13e). Retur- 
ning to the original quantities, we have 
emildin.. ia|c|* 
b+Ve+E Sate (b+ Va+E) 
bE + (E—4a*)Va?+ E 


ete US pei a gE AY oO 8 
" (a?-+ E)3/2 (b+ Vae+ EB)’ Ne) 


so taking the square we arrive at the expression for e,(c). Notice that k, is purely 
imaginary; if a = 0 its sign for small |c| is given by the denominator of the second 
term on the right side. Assertion (b) is proved in a similar manner; we obtain 


ky = —1a + 


: ib\c}? ; E+ (267+ E)J/t?—E 
ly =<iVPCE ible| ee 18344 0)(\at"), 


VRE (a+ VP=E) — 2(?—-B)9/ (a+ VP—E) 
which leads to the expression for the eigenvalue e2(c) , and 
b|c|? i(2b?+ E)/E-—b? i 
a sae pe ol*+ O(lel*), 
VE-# (a —iVE—6) 2(E—b2)3/2 (a—iVB-P ) 


respectively. In the last named case kz is no longer purely imaginary and its square 
equals 


kg = VE—b? + 


2 “(42 _ o./RoE 
e(c) = EB — 0? + ——— fh, Pr a Sl Iclt + O(|cl®) ; 
a-iVE-  JE-# (a-ivE-#) 


taking the real and imaginary parts we obtain the remaining relations of part (b). 
As in Section 9.6, the resolvent of H, is an analytic function out of the real axis. 
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The corresponding pole is not on the first sheet but can be reached by analytic 
continuation; note that the imaginary part of the pole position is negative. 

Finally, in case (c) the above perturbation expansions lose meaning; we replace 
them by z= —ia + i D., byly|* . Substitution into the spectral condition (15.12) 
yields a system of equations which can again be solved recursively; however, for the 
odd coefficients we now get two solutions differing in sign, 


Var+E iE? |c|? a /—aE(5E-8a’) 


: ‘ 3 4 
hig —to ki ae Cli ger reeege i ata? Ey | + Oe). 


For both signs and |c| small enough, this value is negative imaginary; the proof is 
concluded by taking its square. {J 


15.4.5 Remarks: (a) It follows from the first two assertions that the interaction 
always repels the eigenvalues; in the case (c) it removes the degeneracy. The 
appearance of the powers of |c| in the last expansion is related to the fact that 
the original eigenvalue had multiplicity two; we have obtained an example of 
the so-called Puiseaur series — see the notes. 


(b) We have mentioned that the continuous spectrum of H, is the same as in the 
decoupled case; we can check, moreover, that it is also absolutely continuous 
(Problem 14a). 


After this preliminary, let us now consider the model in question as a scattering 
problem. By Proposition 3, the interaction represents a finite-rank perturbation 
to the free resolvent; hence the existence and completeness of the wave operators 
follow from the Birman-Kuroda theorem. Furthermore, Remark 5b shows that 
Q+(Hu, Hu) are also asymptotically complete. More interesting, however, is the 
explicit form of the S-matrix. 

We can find it easily using a procedure, which is common in the stationary 
scattering theory, and which we used already in Example 7.2.2, namely by look- 
ing for generalized eigenfunctions of the corresponding Hamiltonian with a suitable 
asymptotic behavior. In the present case we take the function 


sie ee 


which is locally square integrable, and require it to belong — again locally — to the 
domain of H,,, in other words, to obey the boundary conditions (15.11). In this 
way, we get a system of linear equations which is readily solved by 
a+ik)(b—ik) — |c|? 
A om Sah) v= (tHE mi) — la 
where D is the same as in Proposition 3, and 


_ Bike 


B: 
D 
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Consider first the case when k? < E. A simple calculation (Problem 14b) shows 
then that |A| = 1, and since the reflection amplitude A is (with the choice of sign 
that we have made) just the s-wave part of the first-channel on-shell scattering 
matrix (see the notes), we have checked that the scattering is elastic for k? < E. 
In other words, there is no solution with a “plane-wave” component in the excited 
channel, because the incoming particle does not have enough energy to excite the 
target permanently; we usually say that the inelastic channel is closed in this case. 

Furthermore, recall that the on-shell S—matrix (in a given partial wave, in our 
case £=()) is expressed conventionally through the phase shift as So(k) = eo“) . 
Another simple calculation then yields 

k(b—ik) 
a(b—ik) — |c|? 
The most interesting situation occurs when H,,, has an embedded eigenvalue, which 


turns under the perturbation into a resonant state whose lifetime, according to 
Example 9.6.6 and Theorem 4b, is expected to be 


6o(k) = arctan (mod 7r) . 


—V+E 
10) = — Fa pep (1+ Olle?) 


we shall return to this problem a little later. At the same time, the phase shift 
should exhibit a sheer change peculiar to a resonance. Since « is purely imaginary 
in this case, we may rewrite the expression of the phase shift as 


Pr Pere? at 
a(b + VE—k?) — |c|? 

If there is a bound state in the first channel, a < 0, we have do(k) < 0 in the 
interval (0, /£—b6?) with zero value at its endpoints. Furthermore, 69(k) reaches 
the value $= at ko := \/E — (|c|?a~!— 6)? provided |c|? < ab ; in the opposite case 
the argument of the arctan in the above expression has no singularity at all in 
(0, VE). For a,b fixed and |c| small enough, the phase-shift plot is composed of 
the background arctan(#) and the jump to the next branch of this function around 
the singularity. The energies at which 69(k) = +7, e.g., are 


2b|c|? 5 
re atopy Bt VE b?) + O(|cl*) , 


i.e., Reeo(c) + Ime2(c). However, it has to be pointed out that the jump may be 
substantially deformed, even for small |c|, if |a| is small enough; to see this it is 
sufficient to take an arbitrarily but fixed |c| and a := |c|?/b. 

We can also observe the resonance behavior in the cross Baevion: Since oe 
scattering is obviously isotropic, we have otor(k?) = oeao(k?) = 44 sin? 6p(k). 
the embedded-eigenvalue case therefore the above expression of the on shift , 
for k? < E the following relation, 


a= E-B+ 


4n (b+ VE—F)" 


tot kK? ee a> a a. 
elt [a (6+ VE—F*) — |cl?]’ +4? (b+ VE—) 
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For a,b fixed and |c| small enough, the cross section varies rapidly (has a dip 
followed by a peak) in the region of the width ~ T(c)~! around the value k = ko. 
Let us now turn to the case when k? > E, i.e., the incident particle energy is 
greater than the excitation energy of the target. The reflection and transmission 
amplitudes then satisfy 
|Al? + 7 BP = 1. 


This relation illustrates that the requirement of Hamiltonian self-adjointness (which 
is contained in the boundary conditions determining the amplitudes A, B ) means 
physically the conservation of probability flow (cf. Problem 14.24d): it is easy to 
see that its radial components J; := 2Im f;f} are 


J, = -2k(1-|Al*) , Jo = 2n|BP 


in the first and the second channel, respectively. The elastic scattering is now 
nonunitary since B # 0 in the interacting case, and therefore |A|? < 1 ; this is due 
to the fact that the “nucleus” may now leave the interaction region in the excited 
state. The above relation can also be expressed as 


|So,11(k)|? + |So1+2(k)|? = 1, 


i.e., as a part of the full two-channel S-matrix unitarity condition (Problem 14b). 

Let us return to the resonance-scattering situation. We have already mentioned 
the generally accepted concept, according to which scattering resonances are quasi— 
stable objects of a sort. To illustrate this in the present example, we are going in 
the rest of this section to examine in more detail what the resonant state arising 
from the embedded eigenvalue of H,,, looks like and how it decays. The most 
natural choice for the “compound nucleus” wave function is the eigenstate of the 
unperturbed Hamiltonian (see the notes), 


f: f(r) = V-2 ( oJ (15.13) 


recall that —/E <b <0. To find the decay law of this state, we have to compute 
according to Section 9.6 the reduced resolvent (f,(H,—k?)~'f) and investigate its 
analytic structure. Using Proposition 3, we obtain after a simple integration 


~~ [cl]? + (a—ik)(b+ik) 
ERD = Prin Fle ~ (a= iR)(0 AR) 


(Problem 15a). Notice first that the function on the right side corresponds to a four- 
sheeted Riemann surface with respect to the complex parameter z = k? (one usually 
speaks about the complex energy plane, or briefly, z—plane): it has cuts along the 
intervals [0,00) and [£,oo) on the positive real axis corresponding to the square 
roots in the definitions of k and x. It is customary in stationary scattering theory 
to pass to the variable k (we speak about the momentum plane, or k-plane); 
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then the above expression corresponds to a two-sheeted surface with the cuts along 
(—c0o, -VE] and [VE, ov). 

The reduced evolution operator of the resonant state, which acts in the present 
case as multiplication by v,(t) := (f,e7*#*‘f) on the one-dimensional subspace of 
H. spanned by the vector f can be calculated as in Section 9.6. Using the fact that 
the reduced resolvent is (in the z—-plane) analytic in € \ IRt+ , we may replace the 
integral appearing in the Stone formula by integration over a curve encircling the 
spectrum of H,, in a clockwise way. In the k—plane this gives 


walt) = = fe", (Hak) dk 


where I’ is a curve in the upper halfplane (corresponding to the “physical sheet” 
in the z—plane) which runs from —oo + ie to 00 + i€ passing above the set {k : 
k? € o(H.)} (draw a picture). Notice that we have to consider only ¢ > 0 since 
vy(—t) = vu(t). 

Evaluating the integral (Problem 15a) we have to distinguish two cases. If there 
is an antibound state in the first channel, the corresponding pole is not on the first 
sheet and the line {k. = k+ie : k € R, € > 0} can be used as I’. On the other 
hand, if the pole is placed on the first sheet, then moving I towards the real axis 
in the k-plane, we get in addition the integral over a clockwise circle around the 
pole. The last named contribution to the integral can be calculated by the residue 
theorem; we get 


Ic|? + (a—ik,)(b+iK1) etki 
(b+ixy)?||cl? — (a—ik)(b—ix)| 


| eile) \cl? + (a—ik+e)(b+ike) (k-+ie) dk 
R (b+ine)?||c|? — (a—ik+e)(b—ixe)] 


Vy(t) = — jim (k—ky) 2k; 


Ty 60+ 


where «, corresponds to the first-channel bound-state pole position k; (cf. the 


proof of Theorem 4a) and «, := ,/(k+te)? — E. If a > 0 we get the same expression 
without the first term; in the case of zero-energy resonance, which we shall not 
discuss in detail, the first term enters with the factor 3 ' 

By a straightforward computation using the identity |c|? + (a—iki)(b+iK,) = 


2b(a — ik) , we find that for a < 0 the first term equals 
4ab|c|* 

(a?—b?+ EF)? 

On the other hand, the limit ¢ — 0+ in the second term cannot be interchanged 


with the integral directly because the limiting function is not integrable. Fortunately, 
we can rewrite the integral using a simple substitution as 


eit (1 4+ O(|cl?) . 


re) 2 a . . 
if enikt \cl? + (a—tke)(b+ike) — 
0 (b+ine)?||c|? — (a—ike)(b—ike)| 
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36 “ 2 OTe ee a 
ne i en ike __ lc" lee fl ap 
0 (b+i%.)?{|cl? — (atike)(b—ike)| 


where k, := k+ie and 

Ke «1. OX KS VE 
= 

eee VE <k 


Here we have used the fact that the limit of « when k approaches the real axis 
from the upper halfplane (on the first sheet) equals +Vk?—E on the right and left 
cut, respectively, and iv/E—k? in between. When we now write the expression in 
question as a single integral over (0,00) , the dominated—convergence theorem may 
already be applied giving 


ar ‘ls et —4ik?b|c|? dk 
mi Jo = (b+)? |cl4 — 2alc|?(b—ik) + (a?+ k?)(b—ix)? 
ett —4ibx|c|* — 4ibk|c|?(b? + x?) 


sees kdk. 
ye ies (b?+ K?)? |c|4 — 2|c|?(ab—kn) + (a? + k?)(b? + x?) 


The first integral is for small |c| expected to be dominated by the contribution from 
the second-sheet resonance pole; recall that the “discriminant” D is contained in 
the denominator. The corresponding residue equals 


\c|? + (a—ik)(b+ix) oniwe 
(b+ix)?||c|? — (a—ik)(b-ix)] 
yf 2bk2K2(a—ik2) en tkit 


Ti (b+ik2)?|E + i(aka—bra) 


where Ko := \/E—k% and kp is given in the proof of Theorem 4b. Using its 
expansion in powers of |c|*, after a tedious but straightforward calculation we find 
that the preexponential factor equals —1/277 up to higher-order terms, i.e., that 
the resonance-pole contribution to v,(t) is et (1+ O(|c|*)] as expected. 

It remains for us to determine the background term. In the first of the above 
two integrals, we close the integration curve by extending it to a point R ; then 
continuing over a circle segment of radius R, and returning to the origin on the 
fourth-quadrant axis. After the limit R — oo, the contribution to the background 
consists of the integrals over (VE,0o) and the fourth-quadrant axis. The first of 
them, however, can be shown easily to cancel, up to higher order terms in |c|? , 
with the second integral; hence we finally obtain the following result for the reduced 
propagator and the corresponding decay law P,(t) := |vu(t)|?. 


im (k—ko)k 
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15.4.6 Theorem: Assume a 4 0 and —VE <b < 0. The reduced propagator 
and the decay law of the resonant state (15.13) corresponding to the Hamiltonian 
Hy, are given by 


-t 2(|al—a)b ya ib ya 
Jay a = ik3t — 2 ae ikst el ik3t 
vault) {« # Iq ee ot ee 
4b ; 00 22e-2"t dz : 
aM pa ni/4 pe 2 
ss T [ aE ea |} (14 O(le| )) 


and 


2A(|a|—a)b  _iya pa 
aa 2(Imeg)t 2 i(k?-k. 
Hatt) = fe 2 Qic|*Re (ete (ki -kg)t 


ab 2(Imeg)t 
+ 7eVa-WEp eee 
2 


4b i(k2t— 0° z en27t dz 
—— et(k3t—m/4) ee ee : 
mS i (z2+ ia?) (2? ean} (1+ O(\¢ )) ) 


where e; := e;(c) := k3 are specified in Theorem 4. 


Hence we have obtained the expected result, namely the exponential decay law 
with a lifetime proportional to the inverse distance of the resonance pole from the 
real axis, up to higher order terms in |c|?. 

These corrections are also interesting. First of all, the model illustrates that 
the initial decay rate can be zero even if the decaying state does not belong to the 
domain of the Hamiltonian (cf. Problems 15b and 9.18). On the other hand, the 
long-time behavior of the decay law depends substantially on the spectrum of the 
unperturbed Hamiltonian: if it has an eigenvalue in the first channel (so H, also 
has an eigenvalue for a sufficiently weak coupling), then the decay law contains a 
term of order of |c|*, which does not vanish as t > oo, and therefore dominates the 
expression for large time values; it is clear that this term comes from the component 
of the first-channel bound state contained in the resonant state (15.13). 


Notes to Chapter 15 


Section 15.1 One often speaks about particle beams of a given energy when describing 
scattering experiments. This is not precise, of course, because scattering states belong to 
the continuous spectrum of the Hamiltonian. The correct meaning is the following: using 
spectral representation for the free Hamiltonian, we describe the incoming—particle state 
by a wave packet with energy support in the vicinity of a given energy value, assuming 
that the support size is small enough in the energy scale characteristic for the processes 
under consideration. If we study resonance scattering, for instance, we have to choose the 
“beam monochromaticity” small compared to the widths of the resonances in question. 
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The primary, or direct, problem of scattering theory is to find measurable quantities 
such as cross sections, phase shifts, etc., from knowledge of the interaction between the 
projectile and the target. The inverse scattering problem poses the opposite question: 
to find the interaction, most frequently the potential in a Schrédinger operator, from 
scattering data; more information can be found, e.g., in the monograph [ChS]. 

The assumption that the projectile and target behave as free at large distances, 1.e., 
that there are asymptotic states w+ to a given scattering state ~, which we have used 
to introduce the wave operators, is not valid for long-range interactions like the Coulomb 
force. In this case the definition of the wave operators has to be modified — see, e.g., 
[ AJS], Chap.13; [RS 3], Sec.X1.9. Notice also that in physical literature for histori- 
cal reasons one mostly uses the “reversed” convention in which the wave operators 04 
correspond to the limits t -> oo. 

Proposition 3 generalizes in various ways — see [RS 3], App. to Sec.XI.3. The result 
we have derived for scattering states modifies to the case o.-(H) # 0; the set M,(H) 
has to replaced by the subspace 


: T | 
M,(H) := {wen : ee a | \Fee~4t yi? dt = 0 for all r>o} 
Tc 2T J_7 


which satisfies M,(H) = H.(H) — cf. [AG 1], and also [ AJS], Sec.7.6; [RS 3], App. 
to Sec.XI.7. Hence the pathological states mentioned in Remark 5 belong to the singularly 
continuous spectral subspace of the Hamiltonian. 

Basic information about scattering theory can be found in most quantum mechanical 
textbooks; an extensive exposition is given, e.g., in [Ta], [New]. Rigorous formulation 
of scattering theory is the content of the monographs [Am], [AJS], [BW], [Pea], [Pe], 
and [RS 3]. We should keep in mind that the terminology is sometimes not unified: the 
(asymptotic) completeness is often called (strong) asymptotic completeness, etc. Some 
authors use the term Moller operators for 04, after the author who introduced them 
formally for the first time in 1945. Finally, let us mention that Hilbert space methods are 
also used in acoustic and electromagnetic—wave scattering — see [LP]. 


Section 15.2 Proposition 3 looks like a simple tool for proving the completeness; however, 
in order to use it, one has to check the existence of 24(Ho, H), which is often difficult. The 
Cook criterion was proved in the paper [Coo 2], which in 1957-58 together with {Ja 1,2], 
[Ka 6], [LF 1] formulated the foundations of rigorous scattering theory. Theorems 6 and 
8-10 form the backbone of the so-called Kato-Birman theory, named after the authors who 
formulated it together with S. Kuroda, R. Putnam, M.G. Krein, D.B. Pearson and others. 
Theorem 6, which allows us to simplify earlier proofs considerably, comes from [{ Pea 2]. 
The Kato—Rosenblum theorem is the oldest result of the theory; it can be found in (Ka 6], 
[Ro 1]. Theorem 9 was proved in [ Kur 1,2], { Bir 2,3]; note that there are examples 
showing that J\(H) cannot be replaced in the hypothesis by J,(H) for some p > 1. 
One possible application of Theorem 9 is to prove existence and completeness of wave 
operators for point-interaction Hamiltonians with a finite-number of centers introduced 
in the same way as in Section 14.6. Theorem 10 was proved in {Bir 4]. Further references 
to Kato-Birman theory can be found in [RS 3], notes to Sec.XI.3. 

The invariance principle was discussed for the first time in { Bir 3]; later it was 
repeatedly generalized. Proof of Theorem 11 and further details can be found in [Ka], 
Sec.X.4; [RS 3], Sec.XI.3 and App.3; [ We], Thm.11.13. 


NOTES 527 


Section 15.3 Theorem 1 was proved in the paper [Coo 2] mentioned above for potentials 
from L?(JR3) and generalized in {Ha 1] to potentials V € L?,.(IR3), which behave as 
O(r-!-*) at large distances. This result was further extended in [KuS 1] with the help of 
Theorem 15.2.5 to potentials of the same decay but stronger local singularities. Theorem 4 
extends to Schrédinger operators on L?(IR") — see [RS 3], Thm.X1.30; its disadvantage is 
that the requirement V € L!ML? represents a rather strong decay restriction. Theorem 6 
follows from the Birman—Kuroda theorem combined with the partial-wave decomposition 
— ef. [RS 3], Thm.XI1.31; an alternative proof can be found in [DF 1]. 

To avoid the impression that the completeness of wave operators is a natural property, 
which is just difficult to prove, recall the example constructed in [Pea 1]. It represents 
a short-range central potential composed of rectangular barriers and wells chosen so that 
the corresponding wave operators are not complete; physically this result means that the 
scattered particle may be captured, so it is not able to leave the interaction region. 

The Agmon and Enss conditions formulate in the same way for Schrédinger operators 
on L?(R"), and both Theorem 7 and Remarks 8 extend to this case. The first part 
of Theorem 7 represents a particular case of the Agmon-Kato-Simon theorem — see 
{RS 3], Thm.XIIL.33. The remaining part is proved by the “geometric” method proposed 
by V. Enss, which is based on estimates of the asymptotic behavior of spreading wave 
packets — see [En 1,2] and also [Si 7]; [RS 3], Sec.X1.17; [Am]; [Pe], Sec.2.7; 
[CFKS], Chap.5; [En 3]. Similar methods have been used in recent proofs of asymptotic 
completeness for N-particle scattering for some classes of short-range and long-range 
interactions — see, e.g., [SiS 1,2], {Sig 1], and also [Der 1,2], [Gra 1], [Kit 1]. 


Section 15.4 The model discussed in this section was introduced in [Ex 6]. It is known 
from perturbation theory that if the unperturbed eigenvalue has a multiplicity m > 1, 
then the perturbed eigenvalue is given by the so-called Puiseaux series, t.e., rather than in 
powers of the perturbation parameter g itself it expands in powers of g!/™ — see [Ka], 
Chap.2; in the present case we have g = |c|? and m=2. 

As we have mentioned, the relation between stationary scattering theory and the con- 
cepts discussed in Sec.15.1 is given by the direct-integral decomposition S = f® S(A) dA, 
where S(A) is the on-shell scattering matrix for energy \ — see [AJS], Sec.5.7, for more 
details, and also the notes to Sec.7.2. The generalized—eigenfunction expansion can also 
be written for the model in question, but we shall not discuss this problem. 

In the decoupled case, c = 0, the expression obtained for the phase shift reduces 
to the known formula for the point-interaction phase shift — cf. [ AGHH], Sec.I.1.4. 
In the same way as the phase shift and cross section, other scattering quantities can be 
computed for the present model, e.g., the time delay caused by the scattering which is 
related to the S—matrix by the so-called Eisenbud-Wigner formula — see [ AJS], Sec.7.2; 
and also {Mar 1], [AmC 1]. The appearance of the factor «/k in the relation between the 
reflection and transmission amplitudes expresses the fact that the particle has different 
velocities in the two channels. Of course, true scattering states which are wave packets 
composed of generalized eigenfunctions do not have a sharp value of velocity, but even 
then the mean values of the velocities differ because a part of the incident kinetic energy 
has been absorbed to excite the target. 

The ways to choose the resonant-state wave function have been discussed in [Hun 1]; 
it appears that for an embedded-eigenvalue perturbation problem like that treated here, 
(15.13) is essentially the only possibility — see [Ex 6] for more details. The resonances 
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in the present model come from zeros of the function D, so they are simultaneously poles 
of the (analytically continued) reduced resolvent and scattering matrix. In general, one 
expects that the same should be true in other cases too, i.e., that different possible ways 
to define a resonance will lead to the same result; however, it is not always easy to prove 
this — see, e.g., [How 3], [BDW 1], and also {Si 4], [Hag 1] for the equivalence between 
the scattering resonances and the dilation—-analytic resonances mentioned in the notes to 
Sec.9.6. 


Problems 


1. Prove: (a) Suppose that A is self-adjoint, C’ is compact, and w € Ha-(H) ; then 
limje|00 Cc ety, =0. 
(b) The conclusion of Proposition 15.1.2 remains valid if o.-(H) = @ and the 
operators F,(H —z)~* are compact for some z € p(H), 8 >0, and all r>0. 
Hint: (a) The Riemann-Lebesgue lemma implies w-limje)—.00 etAty, — 0, 
(b) (H — z)’P, with P, := Ey(—a,a) is bounded for any a, >0. 
2. The function g: g(k) = |k? — z|-! belongs for Imz #0 to L?(iR?) and its norm 
is equal to ||g|| = z|Im/z|~!/? for Rez > 0. 
3. The wave operators 24(H, Ho) have the following properties: 
(a) the subspaces RanQy reduce the group {U(t)}, and H}RanQ4z is unitarily 
equivalent to Ho} Hac(Ho) , 
(b) RanQy C H.-(H). 


Hint: (a) Use the intertwining relations. 
4. Suppose that the wave operators Q4 exist and satisfy relation (15.3); then 


(a) the scattering operator is a partial isometry with the initial subspace 7Ha-( Ho) 
and S}#4-(Ho) is unitary iff Ran0y = RanQ_, 

(b) the operator S commutes with the free Hamiltonian, i.e., Uo(t)S = SUo(t) 
for all te RR. 

Hint: (b) Use the intertwining relations. 
5. Prove: (a) Q4 = s-lime.ioo Vo(t)*U(t) Ps, where the Ps are the projections onto 

the subspaces RanQz., 

(b) Uo(t)Q% = ON4U(t) for all t € IR, and furthermore, 04H C Hot. 


6. Prove: (a) Let w € D(Ho) and Uo(t)y € D(H) for some t € IR. Then the 
function w : ~(s) = Uj(s)U(s)w is differentiable at the point t and y(t) = 
iU (t)*(H —Ho)Uo(t)y. 

(b) Suppose that ~, ¢ : IR — H are differentiable functions; then f : f(t) = 
(u(t), (t)) is also differentiable and f’(t) = ((t), O(t)) + (Y(t), ¢(d)). 
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(c) If a function wy : IR — H is continuously differentiable on an interval {a, 5], 
then y(d) — ¥(c) = [4 W(t) dt holds for a<c<d<b. 


7. Let Ho, V be self-adjoint operators and Up(t) := e~*#o*. If V is Ho-bounded; 
then the vector-valued function VUo(-)W is continuous for any ~ € D(Ho). 


8. Let :,. be the operator defined in the proof of Theorem 15.2.6. Show that 
(a) O24 is compact, 


(b) the function T ++ Q47,547¢ is continuously differentiable for any ¢€ H and 
d 
ae OQe47,2470 = (C(t =: T) = C(s a T))¢ : 


Hint: (a) Using the continuity of C(-)¢, check that ¢n — ¢ implies Q,s¢n > D2,6¢. 
9. Consider the Friedrichs model of Section 9.6 as a scattering system. Prove that 


(a) the wave operators 04(H,, Ho) exist and are complete, 
(b) if the function v is continuously differentiable, then the 0+(H,, Ho) are also 
asymptotically complete. 


Hint: (b) In analogy with the proof of Proposition 9.6.7 express the full resolvent 
and use Theorem 14.4.7 — cf. [DE 2], Part IV. 


10. If self-adjoint H, Ho have the same domains, they are mutually dominated. 
Hint: Use the closed-graph theorem for J + |H|, J +|Hol. 


11. The Schrédinger operator Hp +V(Q) on L?UR"), n>3, with Ve I? +L? is 
self-adjoint on D(Ho) provided p satisfies condition (14.2) and s & [p,n). 
Hint: Check that L? Cc 1? + L®™. 


12. Using the notation from the proof of Theorem 15.3.1, 


(a) compute Uo(t)%, , 
(b) verify the estimate of the expression ||V(Q)Uo(t)wql| used there, 


(c) generalize the Haak-Cook theorem to Schrédinger operators H = Hp + V(Q) 
on L*(JR"), n> 3, with a potential obeying the assumptions of Problem 11. 
In the same way, extend the conclusions of Example 15.3.2. 


13. Using the notation of Section 15.4, 


(a) find all self-adjoint extensions of the operator Ho, 


(b) show that the system with the Hamiltonian H, is invariant with respect to 
the time reversal iff the coupling constant c is real, 


(c) compute the functions ;,(-) from the proof of Proposition 15.4.3, 
(d) find the multiplicity of o(H,,) corresponding to uo := {a,b,0}. 
(e) fill the details into the proof of Theorem 15.4.4. 
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Hint: (d) Cf. Example 9.1.6. 
Consider the model of Section 15.4 as a scattering system. 


(a) Prove that os(Hu) =9. 


(b) Find the on-shell scattering matrix and prove its unitarity. Derive the phase— 
shift expressions for elastic-channel scattering. 


Hint: (a) Use Proposition 15.4.3 and Theorem 14.4.7. 


Consider the operator H, discussed in Section 15.4 as the full Hamiltonian of the 
unstable system whose state space is the one-dimensional subspace in # spanned 
by the vector (15.13). 


(a) Fill the details into the sketched proof of Theorem 15.4.6. 
(b) Show that the decay law (15.14) has zero initial decay rate, Py(0+) =0. 
Hint: Im(f,(Hu—A)~'f) is O(A~*/) for large positive  ; use Problem 9.18. 


Find the on-shell scattering matrices Sy(k) for each of the operators Hy considered 
in Proposition 14.6.5 and Problem 14.24, which describe a particle on the Y-shaped 
graph. What is the limit of Sy(k) as k + 0+ for different U ? 


Consider a charged quantum particle on the graphs of Problem 14.25. 


(a) Find the reflection and transmission coefficients. 


(b) How do the results change if the graphs are placed into a homogeneous electric 
field perpendicular to the leads? 


Hint: (b) For a loop with two leads see [ESS 3]. 


Appendices 


A. Measure and integration 


We suppose the reader is familiar with the basic facts concerning set theory and integration 
as they are presented in the introductory course of analysis. In this appendix, we review 
them briefly, and add some more which we shall need in the text. Basic references for 
proofs and a detailed exposition are, e.g., [Hal 1], [Jar 1,2], [KF 1,2], [Ru 1], or any 
other textbook on analysis you might prefer. 


A.1 Sets, mappings, relations 


A set is a collection of objects called elements. The symbol card X denotes the cardi- 
nality of the set X. The subset M consisting of the elements of X which satisfy the 
conditions Pi(x),..., P(x) is usually written as M={2 EX: P,(x),...,Pa(x)}. A 
set whose elements are certain sets is called a system or family of these sets; the family of 
all subsystems of a given X is denoted as 2* . 

The operations of union, intersection, and set difference are introduced in the standard 
way; the first two of these are commutative, associative, and mutually distributive. In a 
system {Ma} of any cardinality, the de Morgan relations, 


X\UMa = (\(X \ Ma) and X\ (Ma = U(X\ Ma), 


are valid. Another elementary property is the following: for any family {My}, which is at 
most countable, there is a disjoint family {N,} of the same cardinality such that Np C 
M, and U, Nn =U, Mn. The set (M\ N)U(N \ M) is called the symmetric difference 
of the sets M,N and denoted as M AN. It is commutative, MAN = N AM, and 
furthermore, we have MAN =(MUN)\(MNN) and MAN = (X\M)A(X\N) for any 
X > MUN. The symmetric difference is also associative, MA(NAP)=(MAN)AP, 
and distributive with respect to the intersection, (MAN)NP=(MNP)A(NNP). 

A family R is called a set ringif MAN ER and MON ER holds for any pair 
M,N ER. The relation M\ N = (MAN)MM also gives M\ N € R, and this in 
turn implies 9 € R and MUN ER. If the symmetric difference and intersection are 
understood as a sum and product, respectively, then a set ring is a ring in the sense of the 
general algebraic definition of Appendix B.1. 


A.1.1 Example: Let 7% be the family of all bounded intervals in IR¢, d > 1. The family 
R? , which consists of all finite unions of intervals J Cc 7% together with the empty set, is 
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a set ring, and moreover, it is the smallest set ring containing 7 ¢_ As mentioned above, 
any Re R¢ can be expressed as a finite union of disjoint bounded intervals. 


A set ring R C 2% is called a set field if it contains the set X (notice that in the 
terminology of Appendix B.1 it is a ring with a unit element but not an algebra). A 
set field A C 2% is called a o-field if US, Mn € A holds for any countable system 
{M,,}22, C A. De Morgan relations show that also 122, Mn € A, and futhermore that 
a family A Cc 2* containing the set X is a o-field iff X\M eA forall M eA and 
Un Mn € A for any at most countable subsystem {M,} C A. Given a family 5 C gx 
we consider all o-fields A C 2* containing S (there is at least one, A = 2* ). Their 
intersection is again a o-field containing S ; we call it the o-field generated by S and 
denote it as A(S). 


A.1.2 Example: The elements of B¢ := A(J*) are called Borel setsin IR?. In particular, 
all the open and closed sets, and thus also the compact sets, are Borel. The o-field B? is 
also generated by other systems, e.g., by the system of all open sets in R?. In general, 
Borel sets in a topological space (X,7) are defined as the elements of the o-field A(T). 


A sequence {M,}?2, is nondecreasing or nonincreasing if Mn C Mn4i or Mn D 
Mn+1, respectively, holds for n = 1,2,.... A set family M is monotonic if it contains the 
set U,, Mn together with any nondecreasing sequence {M,}, and (),, Mn together with 
any nonincreasing sequence {M,,}. Any o-field represents an example of a monotonic 
system. To any S there is the smallest monotonic system M(S) containing S and 
we have M(S) Cc A(S). If R is a ring, the same is true for M(R) ; in addition, if 
Umer M € M(R), then M(R) is a o-field and M(R) = A(R). 

A mapping (or map) f from a set X to Y is a rule, which associates with any 
x € X a unique element y = f(x) of the set Y ; we write f : X — Y and also 
xr f(x). If Y = R or Y = @ the map f is usually called a real or a complex 
function, respectively. It is also often useful to consider maps which are defined on a 
subset D; C X only. The symbol f : X — Y must then be completed by specifying the 
set Dy which is called the domain of f ; we denote it alsoas D(f). If Dy is not specified, 
it is supposed to coincide with X. The sets Ranf:={y¢Y: y= f(r), x2 € Dy} and 
Ker f := {xz € Dy: f(x) =0} are the range and kernel of the map f , respectively. 

A map f:X —Y is injective if f(a) = f(x’) holds for any 2,2’ € X only if r=2' ; 
it is surjective if Ran f = Y. A map which is simultaneously injective and surjective is 
called bijective or a bijection. The sets X and Y have the same cardinality if there is 
a bijection f : X + Y with Dy = X. The relation f = g between f : X — Y and 
g:X —Y means by definition Dy = Dy and f(x) = g(x) for all x € Dy. If Dy D Dg 
and f(x) = g(x) holds for all « € Dg we say that f is an extension of g while g is a 
restriction of f to the set Dg ; we write f Dg and g=f}D,. 


A.1.3 Example: For any X C M we define a real function yy : xu(z) = 1 if x € 
M, xm(z) =Oif x € X \ M ; it is called the characteristic (or indicator) function of the 
set M. The map M +> yw is a bijection of the system 2* to the set of all functions 
f:X — R such that Ran f = {0,1}. 


The set f(-)(N) := {xe Dy: f(x) € N} forgiven f :X + Y and N CY iscalled 
the pull-back of the set N by the map f. One has f{" (Uae, Na) = User f'""(Na) for 
any family {Na} C Y , and the analogous relation is valid for intersections. Furthermore, 
fDCMA\N2) = FOP (ND)\ FOP (Ne) and f(f-)(N) = Ran fAN. On the other hand, 
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f‘-)(f(M)) D M ; the inclusion turns to identity if f is injective. 


A.1.4 Example: Let f: X + Y with Ds; =X. To ao-field BC 2 we can construct 
the family f(-) := { f(-D(N): N € B}, which | is obviously again a o-field. Similarly, if 
Aue 2%..igva qanael then the same is true for {N CY: f‘-)(N) € A}. Hence for any 
family S C 2Y we can construct the o-field f(-(A(S)) C 2* and the latter coincides 
with the o-field generated by f'-!)(S), ie. fO-)(A(S)) = A(fO-(S)). 


Given f: X —Y and g: Y — Z we can define the composite map gof : X + Z 
with the domain D(gof) := f‘-))(D,) = f(-)(Dyn Ran f) by (gof)(x) := 9(f(x)). We 
have (gof)(P)-) = f- Doe (py), for any PC Z. 

If f : X —Y is injective, then for any y € Ranf there is just one z, € Dy such 
that y = f(zy) ; the prescription y ++ g(y) := zy defines a map g: Y — X, which is 
called the inverse of f and denoted as f—!. We have D(f~!) = Ranf, Ran f-! = Dy 
and f—!(f(x)) =z, f(f71(y)) =y for any x € Dy and y € Ran f, respectively. These 
relations further imply f(-)(N) = f-!(N) for any N C Ranf. Often we have a pair of 
mappings f: X — Y and g: Y +X and we want to know whether f is invertible and 
f-! =q; this is true if one of the following conditions is valid: 


(i) Dg =Ranf and g(f(x)) == for all xe Dy, 
(ii) Ranf Cc D,, g(f(z)) =< for all e € Dy and Rang C Dy, f(g(y)) = y for all 
ye Dy. 

If f : X + Y is injective, then f—! is also injective and (f-!)-! =f. If g: Y 3 Z 
is also injective, then the composite map gof is invertible and (gof)~! = f-tog-!. 

The Cartesian product M x N is the set of ordered pairs [x,y] with 2 € M and 
y € N ; the Cartesian product of the families S and S’ is defined by SxS':={MxN: 
Mé€S,N €S’'}. For instance, the systems of bounded intervals of Example 1 satisfy 
gen = I"™x IJ”. If Mx N is empty, then either M = @ or N = @. On the other 
hand, if M x N is nonempty, then the inclusion M x NC Px R implies M Cc P and 
NCR. We have (MUP)xN=(MxN)U(PXN) and similar simple relations for the 
intersection and set difference. Notice, however, that (M x N)U(P x R) can be expressed 
in the form S xT only if M=P or N=R. 

The definition of the Cartesian product extends easily to any finite family of sets. 
Alternatively, we can interpret M,x---x My as the set of maps f: {1,...,n} > Uji. M; 
such that f(j) € M;. This allows us to define XgeyMa for a system {Ma : a € I} 
of any cardinality as the set of maps f : I — Uge; Ma which fulfil f(a) € Ma for any 
a €I. The existence of such maps is related to the axiom of choice (see below). 

Given f : X +> @ and g: Y — (, we define the function f x g on X x Y by 
(f x g)(x,y) := f(x)g(y). Let MCX xY ; then to any x € X we define the x-cut of 
the set M by Mz :={yeEY: [x,y] © M}; we define the y-cuts analogously. 

Let A c 2%, Bc 2” be o-fields; then the o-field A(A x B) is called the direct 
product of the fields A and B and is denoted as A@B. 


A.1.5 Example: The Borel sets in JR” and JR” in this way generate all Borel sets in 
IR™™ | i.e., we have B"™ @ BY = B™™ 


On the other hand, the cuts of a set M € A@B belong to the original fields: we have 
M, € B and M, € A for any xe X and y EY, respectively. 

A subset Ry C X x X defines a relation y on X : if [x,y] € Ry we say the element 
x is in relation with y and write yy. A common example is an equivalence, which is 
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arelation ~ on X that is reflexive (2 ~ x for any x € X), symmetric (r~y implies 
y ~ x), and transitive (a ~y and y~z imply x ~z). For any x € X we define the 
equivalence class of x as the set T, := {ye X:y~a}. We have T, =T, iff r~y, 
so the set X decomposes into a disjoint union of the equivalence classes. 

Another important example is a partial ordering on X , which means any relation < 
that is reflexive, transitive, and antisymmetric, i.e., such that the conditions z < y and 
y <a imply z= y. If X is partially ordered, then a subset M C X is said to be (fully) 
ordered if any elements x,y € M satisfy either x < y or > y. Anelement 7 € X is 
an upper bound of a set M C X if y <2 holds for all y € M ; it is a maximal element 
of M if for any yé€ M thecondition y > x implies y=2z. 


A.1.6 Theorem (Zorn’s lemma): Let any ordered subset of a partially ordered set X 
have an upper bound; then X contains a maximal element. 


Zorn’s lemma is equivalent to the so-called aziom of choice, which postulates for a 
system {M,: a € I} of any cardinality the existence of a map @ ++ Xq such that 
La € Ma for all a € I — see, e.g., [DS 1], Sec.1.2, [Ku], Sec.I.6. Notice that the 
maximal element in a partially ordered set is generally far from unique. 


A.2 Measures and measurable functions 


Let us have a pair (X,.A), where X isa set and AC 2* ag-field. A function f: X +R 
is called measurable (with respect to A) if f'-(J) € A holds for any bounded interval 
Jc mR, ie. f'-(7) C A. This is equivalent to any of the following statements: 
(i) f'-((e,00)) € A for all ce M, (ii) fi-(G) € A for any open G C R, (iii) 
f°-)(B) CA. If X is a topological space, a function f : X > R is called Borel if it is 
measurable w.r.t. the o—-field B of Borel sets in X . 


A.2.1 Example: Any continuous function f : IR? — RR is Borel. Furthermore, let 
f: X — IR be measurable (w.r.t. some A) and g: R— R be Borel; then the composite 
function gof is measurable w.r.t. A. 


If functions f,g : X — IR are measurable, then the same is true for their linear 
combinations af +6g and product fg as well as for the function r++ (f(z))~! provided 
f(z) £0 for all x € X. Even if the last condition is not valid, the function h, defined 
by h(x) := (f(x))~! if f(x) #0 and A(x) := 0 otherwise, is measurable. Furthermore, 
if a sequence {f,} converges pointwise, then the function z +> limpoo fn(z) is again 
measurable. 

The notion of measurability extends to complex functions: a function y: X - € 
is measurable (w.r.t. A) if the functions Rey(-) and Img(-) are measurable; this is 
true iff y'-)(G) € A holds for any open set G C @. A complex linear combination of 
measurable functions is again measurable. Furthermore, if y is measurable, then |y(-)| 
is also measurable. In particular, the modulus of a measurable f : X — JR is measurable, 
as are the functions f*:= 3(|f|+ f). 

A function p: X — C is simple (o—simple) if p =, ynxM, , Where yn € C and the 
sets M, € A form a finite (respectively, at most countable) disjoint system with U, Mn = 
X. By definition, any such function is measurable; the sets of (o—)simple functions are 
closed with respect to the pointwise defined operations of summation, multiplication, and 
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scalar multiplication. The expression y = >, nym, is not unique, however, unless the 
numbers y, are mutually different. 


A.2.2 Proposition: A function f : X — R is measurable iff there is a sequence {fn} 
of o-simple functions, which converges to f uniformly on X. If f is bounded, there is 
a sequence of simple functions with the stated property. 


In fact, the approximating sequence {f,} can be chosen even to be nondecreasing. If f 
is not bounded it can still be aproximated pointwise by a sequence of simple functions, 
but not uniformly. 

Given (X,A) and (Y,B) we can construct the pair (X x Y, A@B). Let yp: M—C€ 
be a function on M € A@B ; then its x-cut is the function y, defined on Mz by 
Pr(y) := p(x, y) ; we define the y-cut similarly. 


A.2.3 Proposition: With the above notation, let y be a measurable function; then its 
cut yz is for any z € X measurable w.r.t. B and the analogous statement is valid for 
the other cut. 


A mapping defined on a set family S and such that (M) is either non-negative 
or A(M) = +00 for any M € S is called (a non-negative) set function. It is monotonic if 
M CN implies \(M) < A(N), additive if A.M UN) = \(M)+A(N) for any pair of sets — 
such that MUN €S and MNN =9, and o-additive if the last property generalizes, 
A(Un Mn) = Sp A(Mn), to any disjoint at most countable system {M,} C S such that 
Un Mn ES. 

A set function jz, which is defined on a certain A C 2% , is c—additive, and satisfies 
u(%) = 0 is called a (non-negative) measure on X . If at least one M € A has p(M) < 00, 
then ,(%) = 0 is a consequence of the o-additivity. The triplet (X,A,,) is called a 
measure space; the sets and functions measurable w.r.t. A are in this case often specified 
as -measurable. A set M € JA is said to be p-zero if u(M) = 0, a proposition—valued 
function defined on M € A is valid ~—-almost everywhere if the set N C M, on which it 
is not valid, is ~-zero. A measure p is complete if N C M implies N € A for any p-zero 
set M ; below we shall show that any measure can be extended in a standard way to a 
complete one. 

Additivity implies that any measure is monotonic, and u(M UN) = u(M) + w(N) - 
u(MON) < uM) + uw(N) for any sets M,N € A which satisfy u(MMN) < oo. 
Using the o—additivity, one can check that limpoo u(Me) = uw (URL, Mn) holds for any 
nondecreasing sequence {M,} C A, and a similar relation with the union replaced by 
intersection is valid for nonincreasing sequences. A measure p is said be finite if u(X) < 
oo and o-finite if X =UP?2, Mn, where M, € A and u(Mn) < oo for n=1,2,.... 

Let (X,7) be a topological space, in which any open set can be expressed as a 
countable union of compact sets (as, for instance, the space JR@ ; recall that an open 
ball there is a countable union of closed balls). Suppose that y is a measure on X with 
the domain A > 7; then the following is true: if any point of an open set G has a p-zero 
neighborhood, then p(G) = 0. 

Given a measure yz we can define the function 9, : Ax A —+ [0,00) by o,(MxN) := 
u(M AN). The condition u(M A N) =0 defines an equivalence relation on A and 0, 
is a metric on the corresponding set of equivalence classes. 

A point x € X such that the one-point set {x} belongs to A and p({x}) 4 0 is 
called a discrete point of us ; the set of all such points is denoted as P,,. If us is o—finite the 
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set P,, is at most countable. A measure yp is discrete if P, € A and u(M) = w(MNP,) 
forany MEA. 

A measure p is said to be concentrated on a set SE A if u(M) = u(MOS) for any 
M € <A. For instance, a discrete measure is concentrated on the set of its discrete points. 
If (X,r) is a topological space and 7 C A, then the support of 4 denoted as suppy is 
the smallest closed set on which yz is concentrated. 

Next we are going to discuss some ways in which measures can be constructed. First 
we shall describe a construction, which starts from a given non-negative o—additive set 
function f defined on a ring R C 2* ; we assume that there exists an at most countable 
disjoint system {Bn} C R such that U, Bh = X and (Bn) < oo for n=1,2,.... 

Let S be the system of all at most countable unions of the elements of FR ; it 
is closed w.r.t. countable unions and finite intersections, and M \ R € S holds for 
all M € S and Re R. Any M é€ S can be expressed as M = LU; R;, where 
{R;} C R is an at most countable disjoint system; using it we can define the set func- 
tion ji on S by ji(M) := 5°; 4(R;). It is monotonic and o-additive. Furthermore, 
we have ji(U, Mn) < Sn ii(Mn) ; this property is called countable semiadditivity. To- 
gether with the monotonicity, it is equivalent to the fact that M C Use, Mx implies 
iM) < D1 (Mp). 

The next step is to extend the function ji to the whole system 2* by defining the 
outer measure by p*(A) := inf{ ji(M): M €S, M > A}. The outer measure is again 
monotonic and countably semiadditive; however, it is not additive so it is not a measure. 
Its importance lies in the fact that the system 


Ay = {ACX: jnf ut(A A M) =0} 


is a o-field. This finally allows us to define yw := y*}A, ; it is a complete o—-additive 
measure on the o-field A, > A(R), which is determined uniquely by the set function jf in 
the sense that any measure v on A(R), which is an extension of ji, satisfies vy = 4} A(R). 
The measure yp is called the Lebesgue extension of i. 

A measure pz on a topological space (X,7) is called Borel if it is defined on B = A(r) 
and u(C’) < co holds for any compact set C. We are particularly interested in Borel 
measures on JR¢, where the last condition is equivalent to the requirement (K) < oo for 
any compact interval K Cc IR4. 

Any Borel measure on IR? is therefore c-additive and corresponds to a unique o— 
additive set function 2 on R?. The space IR*?, however, has the special property that 
for any bounded interval J € 7? we can find a nonincreasing sequence of open intervals 
I, > J and a nondecreasing sequence of compact intervals K, C J such that (1), In = 
Un Kn = J. This allows us to replace the requirement of o-additivity by the condition 
A(J) = inf{ (1): 1 € Gy} =sup{ a(K): K € Fy} for any J € J4, where Gy c J? is 
the system of all open intervals containing J, and Fj C 7@ is the system of all compact 
intervals contained in J. A set function ji on J? which is finite, additive, and fulfils the 
last condition is called regular. 


A.2.4 Theorem: There is a one-to-one correspondence between regular set functions / 


and wz := u*} B* on R¢. In particular, Borel measures w and v coincide if u(J) = v(J) 
holds for any J € J?. 


A.2.5 Example: Let f : JR — IR be a nondecreasing right-continuous function. For any 
a,be R,a<b, weset fis(a,b) := f(b—0)— f(a), f(a, b] = f(b) — f(a), and analogous 
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expressions for the intervals [a,b] and [a,b]. This defines the regular set function jis 
on J! ; the corresponding Borel measure Hy is called the Lebesgue-Stieltjes measure 
generated by the function f. In particular, if f is the identical function, f(z) = 2, 
we speak about the Lebesgue measure on JR. Let us remark that the Lebesgue-Stieltjes 
measure is sometimes understood as a Lebesgue extension with a domain which is generally 
dependent on f ; however, it contains B in any case. 


A.2.6 Example: Let ji and D be regular set functions on 7™ C IR™ and J" Cc AR”, 
respectively; then the function 6 on 7™*” defined by 6(J x L) := fi(J)H(L) is again 
regular; the corresponding Borel measure is called the direct product of the measures pu 
and v which correspond to / and 7, respectively, and is denoted as 1@v. In particular, 
repeating the procedure d times, we can in this way construct the Lebesgue measure on 
IR? which associates its volume with every parallelepiped. 


A.2.7 Proposition: Any Borel measure on JIR® is regular, i.e., u(B) = inf{ u(G): GD 
B, G open } = sup{ u(C): C Cc B, C compact }. 


As a consequence of this result, we can find to any B € B% a nonincreasing sequence 
of open sets Gn > B and a nondecreasing sequence of compact sets Cn C B (both 
dependent generally on the measure yz) such that u(B) = limp U(Gn) = U(Nn Gn) = 
limnooL(Cr) = u(U, Cn). Proposition 7 generalizes to Borel measures on a locally 
compact Hausdorff space, in which any open set is a countable union of compact sets — 
see [Ru 1], Sec.2.18. 

Let us finally remark that there are alternative ways to construct Borel measures. 
One can use, e.g., the Riesz representation theorem, according to which Borel measures 
correspond bijectively to positive linear functionals on the vector space of continuous 
functions with a compact support — cf. [Ru 1], Sec.2.14; [RS 1], Sec.IV.4. 


A.3 Integration 


Now we shall briefly review the Lebesgue integral theory on a measure space (X, A, y). 
It is useful from the beginning to consider functions which may assume infinite values; 
this requires to define the algebraic operations a+ oo := 00, a@:00:= 00 for a > 0 and 
a:oo := 0 for a = 0, etc., to add the requirement f‘-")(00) € A to the definition of 
measurability, and several other simple modifications. 

Given a simple non-negative function s := >>, ynxm, on X, we define its integral 
by fy $d := Yn Ynt(Mn) ; correctness of the definition follows from the additivity of . 
In the next step, we extend it to all measurable functions f : X — [0,00] putting 


[ tau = [ se)dula) i= sup{ f sau: ses;}, 


where Sy is the set of all simple functions s : X — [0,00) such that s < f. We also 
define fy f du:= fy fxm dp for any M € A; in this way we associate with the function 
f and the set M anumber from (0, co], which is called the (Lebesgue) integral of f over 
M w.r.t. the measure p. 

A.3.1 Proposition: Let f,g be measurable functions X — [0,co] and M € A; then 
Sulkf) du =k fy fd holds for any k € [0,co) , and moreover, the inequality f < g 
implies fy fdu < fu gdu. 
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Notice that the integral of f =0 is zero even if 4(X) = oo. On the other hand, the 
relation fy ||du = 0 for any measurable function y : X — € implies that w{rex: 
v(x) # 0}) =0, ie., that the function y is zero y-a.e. Let us turn to limits which play 
the central role in the theory of integration. 


A.3.2 Theorem (monotone convergence): Let {fn} be a nondecreasing sequence of non- 
negative measurable functions; then limp—oo fy fn du = fy (liMpoo fn) du. 


The right side of the last relation makes sense since the limit function is measurable. 
However, we often need some conditions under which both sides are finite. The correspond- 
ing modification is also called the monotone-convergence (or Levi's) theorem: if {fn} 
is a nondecreasing sequence of non-negative measurable functions and there is a k > 0 
such that fy fadu<k for n=1,2,..., then the function r++ f(x) := limo fn(Z) is 
p-ae. finite and limpoo fy frdu = fy fd <k. The monotone-convergence theorem 
implies, in particular, that the integral of a measurable function can be approximated by 
a nondecreasing sequence of integrals of simple functions. 


A.3.3 Corollary (Fatou’s lemma): fy(liminf noofn)du < liminf poo fy fradp holds 
for any sequence of measurable functions f, : X — [0, oo]. 


This result has the following easy consequence: let a sequence {fn} of non-negative 
measurable functions have a limit everywhere, limp—oo fn(z) = f(z), and fy fadu <k 
for n= 1,2,...; then fy fdu <k. Applying the monotone—convergence theorem to a se- 
quence {f,} of non-negative measurable functions we get the relation fy ()op=1 fn) du = 
Dr Sx fndu. In particular, if f : X — [0,00] is measurable and {Mn}?%2, C A 
is a disjoint family with UR2, Mn = M, then putting f, := fxm, we get fy fdp = 
Ynez Ju, f du. This relation is called o-additivity of the integral; it expresses the fact 
that the function f together with the measure yz generates another measure. 


A.3.4 Proposition: Let f : X — [0,00] be a measurable function; then the map 
M+ v(M) := fy f du is a measure with the domain A, and fy gdv = fy gf du holds 
for any measurable g : X — [0,00]. 


Let us pass to integration of complex functions. A measurable function y: X - € is 
integrable (over X w.r.t. w) if fy |p| du < oo (recall that if y is measurable so is || ). 
The set of all integrable functions is denoted as £(X,dj) ; in the same way we define 
L(M,dp) for any M € A. Given y € L(X,du) we denote f := Rey and g :=Imy; 
then f* and g* are non-negative measurable functions belonging to £(X,du). This 
allows us to define the integral of complex functions through the positive and negative 
parts of the functions f,g as the mapping 


_ dui= ff stau—f pra if otdu-if ona 
yp [ed it ae me) pis De UC 


of £L(X,du) to C. If, in particular, w is the Lebesgue measure on JR? we often use 
the symbol C(IR¢) or C(IR%,dx) instead of C(IR*,d), and the integral is written as 
J p(x) dx , or occasionally as f (#) dz. 

The above definition has the following easy consequence: if fy, dp =0 holds for all 
MeA, then p(z) = 0 p-ae. in X. Similarly fy, du > 0 for all M € A implies 
v(x) > 0 p-ae. in X ; further generalizations can be found in { Ru 1], Sec.1.40. The 
integral has the following basic properties: 
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(a) linearity: L(X,du) is a complex vector space and f,(ap + v)du = a fy pdp-+ 
Sx wdp for all vy, € L(X,du) and aed, 


(b) [fx edu| < fy |p| dp holds for any yp € L(X, dy). 


A.3.5 Examples: A simple complex function o = ,mxm, on X is integrable iff 
Yen |Mn| (Mn) < oo, and in this case fy odu = >, mu(Mn). The same is true for o— 
simple functions. Further, let functions f : X — [0,00] and y: X + C@ be measurable 
and dv := fdw; then y € L(X,dv) iff yf € L(X,du) and Proposition 4 holds again 
with g replaced by ¢. 


_ For a finite measure, we have an equivalent definition based on approximation of 
integrable functions by sequences of o-simple functions. 


A.3.6 Proposition: If (X) < oo, then a measurable function y : X — € belongs 
to £(X,du) iff there is a sequence {7,} of o-simple integrable functions such that 
limn—oo(suPzex |~(Z)—Tn(x)|) = 0 ; if this is the case, then fy pdu = limpo fy mdp. 
Moreover, if ~ is bounded the assertion is valid with simple functions 7, . 


One of the most useful tools in the theory of integral is the following theorem. 


A.3.7 Theorem (dominated convergence, or Lebesgue): Let M € A and {yn} be 
a sequence of complex measurable functions with the following properties: (x) := 
liMpoo Yn(x) exists for y-almost all x € M and there is a function w € L(X, dp) 
such that |Y~p(x)| < (x) holds w-a.e. in M for n=1,2,.... Then y € £L(X, du) and 


Jim, [ le- eld =0, jim [end = [ edu. 


Suppose that we have non-negative measures ys and v on X (without loss of gener- 
ality, we may assume that they have the same domain) and k > 0; then we can define the 
non-negative measure » := ku+v. We obviously have £(X, dX) = L(X, du) N L(X, dv) 
and fy wd\=k fy pdut fy wdv for any y € L(X,dA). In particular, if non-negative 
measures ps and A with thesame domain A satisfy u(M) < A(M) for any M € A, then 
L£(X, dr) C L(X, du) and fy fd < fy fdX holds for each non-negative f € £(X,dX). 

Let ps, v again be measures on X with the same domain A. We say that v is 
absolutely continuous w.r.t. p and write v < yw if w(M) = 0 implies v(M) = 0 for 
any M € A. On the other hand, if there are disjoint sets S,, S, € A such that py is 
concentrated on S, and yv on S, we say that the measures are mutually singular and 
write tv. 


A.3.8 Theorem: Let ’ and p be non-negative measures on A, the former being finite 
and the latter o—finite; then there is a unique decomposition \ = Ag. + A, into the sum 
of non-negative mutually singular measures such that Ag. < w and A, L 4. Moreover, 
there is a non-negative function f € £(X,du), unique up to a zero measure subset of 
X, such that ddoc = f dp, t.€., Aoc(M) = fay f du for any MEA. 


The relation \ = Aa + Az is called the Lebesgue decomposition of the measure A. 
The second assertion implies the Radon-Nikodym theorem: let y, be o-finite and A finite; 
then \ < yu holds iff there is f € £(X,du) such that d\ = f dy. 
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A.3.9 Remark: There is a close connection between these results (and their extensions 
to complex measures mentioned in the next section) and the theory of the indefinite 
Lebesgue integral, properties of absolutely continuous functions, etc. We refer to the 
literature mentioned at the beginning; in this book, in fact, we need only the following 
facts: a function y : IR > C€ is absolutely continuous on a compact interval (a, }] if 
for any € > 0 there is 6 > 0 such that 3; |9(G;) — y(a;)| < € holds for a finite 
disjoint system of intervals (a;,8;) C [a,6| fulfilling 5°;(G; - aj) < 6. The function 
y is absolutely continuous in a (noncompact) interval J if it is absolutely continuous 
in any compact [a,b] C J. A function y: IR > C is absolutely continous in R iff 
its derivative y’ exists almost everywhere w.r.t. the Lebesgue measure and belongs to 
L(J,dx) for any bounded interval J C JR with the endpoints a < 0; in such a case we 
have (b) — (a) = {2 ¢'(a) de. 

Next we shall mention integration of composite functions. Let w: X — IR? be a 
map such that w‘-!)(B¢) Cc A; this requirement is equivalent to measurability of the 
“component” functions w;: X > R,1<j<d. Suppose that p(w!) (J)) < co holds 
for any J € J¢; then the relation Br p(B) := y(w'-")(B)) defines a Borel measure 
u™ on IR?. 


A.3.10 Theorem: Adopt the above assumptions, and let a Borel function y: R? + © 
belong to C(UIR*, du) ; then gow € L(X,du) and 


/ yd) = / (pow) dy 
B w'-1)(B) 


holds for all b € Bt. 


In particular, if X = IR¢, A = B*, and yu”) is the Lebesgue measure on R?¢, the latter 
formula can under additional assumptions be brought into a convenient form. Suppose 
that w : IR? — IR® is injective, its domain is an open set D C IR?%, the component 
functions w; :D — IR have continuous partial derivatives, (O,w,)(-) for j,k =1,...,d, 
and finally, the Jacobian determinant, D, := det(O,w;), is nonzero a.e. in D. Such a 
map is called regular; its range R := Ranw is an open set in JR?, and the inverse w7! 
is again regular. 


A.3.11 Theorem (change of variables): Let w be a regular map on JR? with the domain 
D and range FR ; then a Borel function y: R + € belongs to L(R, dz) iff (pow)Dy € 
L(D, dz) ; in that case we have 


[i vtwrde = J ((eou)|Dul)(2) de 


for any Borel BCR. 


In Example A.2.6 we have mentioned how the measure @v can be associated with 
a pair of Borel z,v on JR¢. An analogous result is valid under much more general 
circumstances. 


A.3.12 Theorem: Let (X,A,,) and (Y,B,v) be measure spaces with o-finite measures; 
then there is just one measure A on X x Y with the domain A@B such that \(A x B) 
= u(A)v(B) holds for all A € A, B € B; this measure is o-finite and satisfies \(M) = 
Sx v(Mz) du(x) = fy u(M") dv(y) for any ME AQ@B. 
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The measure 2 is again called the product measure of » and v and is denoted as 
u@v. Using it we can formulate the following important result. 


A.3.13 Theorem (Fubini): Suppose the assumptions of the previous theorem are valid 
and y: X x Y —@ belongs to L(X x Y,d(u@v)). Then the cut gy, € L(Y,dv) for 
H-a.a. x € M and the function ® : &(xz) = fy prdy belongs to C(X,dy) ; similarly 
pY € L(X,du) v-ae. in N and UW: W(y) = fy yp’ dw belongs to L(Y,dv). Finally, we 
have fy Odu= fy Vdv = fy,, yd(u @v), or in a more explicit form, 


[(f e@emdew)) ducz) = [ (feu) dule)) doy) = foley) due ry a) 


We should keep in mind that the latter identity may not be valid if at least one of the 
measures y and v is not o-finite. It is also not sufficient that both double integrals exists 
— counterexamples can be found, e.g., in [Ru 1], Sec.7.9 or [KF], Sec.V.6.3. However, 
if at least one double integral of the modulus |y| is finite, then all the conclusions of the 
theorem are valid. 


A.4 Complex measures 


A o-additive map vy: A— C corresponding to a given (X,.A) is called complex measure 
on X ;if v(M) € R for all m € A we speak about a real (or signed) measure. Any pair 
of non-negative measures 41, {42 with acommon domain A determines a signed measure 
by 9 := p41 — pa ; similarly a pair of real measures 91, 02 defines a complex measure by 
V:= 01 +100. 

Any at most countable system {M;}, which is disjoint and satisfies M =U, M;, 
will be called decomposition of the set M,; the family of all decompositions of M will 
be denoted by Sy. To a given complex measure vy and M € A, we define |v|(M) := 
sup { D; \v(M;)| : {Mj} € Su}. One has |y|(M) > |v(M)| ; the set function |v|(-) is 
called the (total) variation of the measure v. 


A.4.1 Proposition: The variation of a complex measure is a non-negative measure; it 
is the smallest non-negative measure such that u(M) > |v(M)| holds for all M € A. 

Using the total variation, we can decompose in particular any signed measure g in 
the form 9 = pt — p-, where p* : p*(M) = $[lo\(M) + o(M)]. Since in general the 
decomposition of a signed measure into a difference of non-negative measures is not unique, 
one is interested in the minimal decomposition 9 = ut - ug such that any pair of non- 
negative measures 41, {42 on A with the property 9 = 41 — {2 satisfies ,(M) > wt(M) 
and p2(M) > u5(M) foreach MEA. 

The minimality is ensured if there is a disjoint decomposition Qt UQ~ = X such that 
De := t0(MNQ*) > 0; the pair {Q+,Q } is called Hahn decomposition of X w.r.t. 
the measure @. The Hehe decomposition always exists but it is not unique. Nevertheless, 
if {Q+,Q-} is another natn decomposition, one has 9(M  Q*) = o(MN@Q#*) for any 
M € A, so the measures pe depend on 9 nai we call them the positive and negative 
variation of the measure g. The formula 9 = Th —p> is named the Jordan decomposition 
of the measure @. 
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One has u3(M)+u5(M) = |o|(M) and us (M) = sup{ +0(A): ACM, AE A} for 
any M © A. As a consequence, the positive and negative variations of a signed measure 
as well as the total variation of a complex measure are finite. One can introduce also 
infinite signed measures; however, we shall not need them in this book. 

Complex Borel measures on IR? have A = B% for the domain. Variation of a complex 
Borel measure is a non-negative Borel measure. As in the non-negative case, a complex 
Borel measure can be approximated using monotonic sequences of compact sets from inside 
and open sets from outside of a given M € B*. Also the second part of Theorem A.2.4 
can be generalized. 


A.4.2 Proposition: Let complex Borel measure y and / on R? satisfy v(J) = o(J) 
for all Je 74; then v=v. 


Before proceeding further, let us mention how the notion of absolute continuity extends 
to complex measures. The definition is the same: a complex measure v is absolutely 
continuous w.r.t. a non-negative uw if u(M) = 0 implies v(A7) = 0 for all M € A. 
There is an alternative definition. 


A.4.3 Proposition: A complex measure v satisfies v < p iff for any ¢ > 0 there is a 
6 >0 such that u(M) <6 implies |y(M)| <e. 


In particular, if y € £(X,du) and v is the measure generated by this function, v(M/) := 
Jueadu, then v < p, so for any € > O there is a 6 > 0 such that p(M) < 6 implies 
fu Pau| <e€; this property is called absolute continuity of the integral. 

Theorem A.3.8 holds for a complex measure » as well. The measure can be even o- 
finite; however, then the function f belongs no longer to L(X,du), it is only measurable 
and integrable over any set M € A with »(M) < oc. The Radon-Nikodym theorem 
yields the polar decomposition of a compler measure: 


A.4.4 Proposition: For any complex measure vy there is a measurable function h such 
that |h(x)| =1 for all re X and dvy=hd|v|. 


Let us pass now to integration with respect to complex measures. We start with a 
signed measure 9 = ut — yz : a function y: X — C is integrable w.r.t. @ if it belongs 
to L(X, dus) L(X, du, ) =: C(X,dg) . Its integral is then defined by 


J de = f vant - [ pus ; 
> ¢ Xx »¢ 


the correctness follows from the uniqueness of the Jordan decomposition. The integral 
w.r.t. a complex measure v represents then a natural extension of the present definition: 
for any function y : X + € belonging to L(X,dv) := L(X,dRev) 1 L(X,dImv) we set 


[vw = f pdRev + [ ydImr. 
x Xx x 


The set of integrable functions can be expressed alternatively as L(X,dv) = L(X,d|v}) ; it 
is a complex vector space and the map y+» fy ydy is again linear. Also other properties 
of the integral discussed in the previous section extend to the complex-measure case. For 
instance, the inequality |fy pdv| < fy |y|dlv| holds for any y € L(X,du). We shall 
not continue the list, restricting ourselves by quoting the appropriate generalization of 
Proposition A.3.4. 
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A.4.5 Proposition: Let y € £(X,dv) for a complex measure v ; then the map M ++ 
(M) := fy, edu defines a complex measure y on A. The conditions y € £(X,dy) and 
wp € L(X,dv) are equivalent for any measurable 7: X — C ; if they are satisfied one 


has fy Ydy= fy vpdv. 


A.5 The Bochner integral 


The theory of integration recalled above can be extended to vector-valued functions F : 
Z — &X, where 4 is a Banach space; they form a vector space denoted as V(Z, +) 
when equipped with pointwise defined algebraic operations. Let (Z,A,) be a measure 
space with a positive measure y. A function S € V(Z, 4) is simple if there is a disjoint 
decomposition {M;}7_, C A of the set Z and vectors y1,...,¥n € ¥ such that S = 
j=1 ¥iXM,; - The integral of such a function is defined by fz S(t) du(t) := hai yju(My) ; 
as above, it does not depend on the used representation of the function S. 

To any F € V(Z,du) we define the non-negative function ||F'|| := ||F(-)||. A vector— 
valued function F’ is integrable w.r.t. yu if there is a sequence {S,} of simple functions 
such that S,(t) + F(t) holds for p-a.a. t€ Z and f; ||F — Sp|| dp — 0. The set of all 
integrable functions F : Z + 4 is denoted by B(Z,du; 7). If F is integrable, the limit 


Jf F@du(t) = jim, f Salt) du(t 


exists and it is independent of the choice of the approximating sequence; we call it the 
Bochner integral of the function F'. The function x,F is integrable for any set M eA 
and F € B(Z,du;%), so we can also define fy, F(t) du(t) := fzxm(t)F(t) du(t). If 
{M,} is a finite disjoint decomposition of M , we have 


[Fo aue = y ia F(t) du(t), 


which means that the Bochner integral is additive. 


A.5.1 Proposition: The map F + f, F(t)d(t) from the subspace B(Z,du;%) C 
V(Z,X) to 4 is linear. Suppose that for a vector-valued function F there is a sequence 
{S,} of simple functions that converges to F p-a.e.; then F belongs to B(Z, dy; ¥) iff 
[Fl] € £(Z, du) , and in that case 


| fred] < [wreondue). 


The existence of an approximating sequence of simple functions has to be checked 
for each particular case; it is easy in some situations, e.g., if Z is a compact subinterval 
in JR and F is continuous, or if Z is any interval, F' is continuous and its one-sided 
limits at the endpoints exist. The continuity of F : R — 2% in an interval [a,b] also 
implies the relation = if F(u)du = F(t) for any t € (a,b). Proposition 1 shows that 
the Bochner integral is absolutely continuous: for any € > 0 there is a 6 > 0 such that 
fy F(t) du(t)|| < € holds for any N € A with u(N) <6. Another useful result is the 
following. 
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A.5.2 Proposition: If B: 4% — ) is a bounded linear map to a Banach space JY ; then 
the condition F € B(Z,du;%) implies BF € B(Z,dy;y), and 


[ero dus = Bf Faun) . 


Many properties of the Lebesgue integral can be extended to the Bochner integral. 
Probably the most important among them is the dominated—convergence theorem. 


A.5.3 Theorem: Let {F,} C B(Z,du;4) be a sequence such that {F,(t)} converges 
for p-a.a. t € Z and ||F,(t)|| < g(t), n = 1,2,..., for some g € L(Z,du). Assume 
further that there is a sequence {S,} of simple functions, which converges to the limiting 
function F: F(t) = limp. Fn(t) p-ae. in Z; then F € B(Z,dp;%) and 


dim, f/ Fa(t)au(t) = ff P(e) due). 


An analogue to Theorem A.3.10 can be proven for some classes of functions, e.g., for 
a monotonic w: R—- R. 

Since B(4’) is a Banach space, the Bochner integral is also used for operator—valued 
functions. For instance, suppose that a map B : IR — B(2) is such that the vector-valued 
function t ++ B(t)x is continuous for any x € 4. Further, let K C JR be a compact 
interval and y a Borel measure on R ; then limp fx B(t)tn du(t) = fx B(t)x du(t) 
holds for any sequence {z,} C 4 converging to a point xz. Moreover, if an operator 
T €C commutes with B(t) for all te K, then fy B(t)ydu(t) belongs to D(T) for any 
y € D(T) and T fy B(t)y du(t) = fx B(t)Tydu(t). 


B. Some algebraic notions 


In this appendix we collect some algebraic definitions and results needed in the text. There 
are again many textbooks and monographs in which this material is set out extensively; 
let us name, e.g., [BR 1], [Nai 1], [Ru 2], or [Ti] for associative algebras, and [ BaR], 
{ Kir], [Pon], or [Zel] for Lie groups and algebras. 


B.1 Involutive algebras 


A binary operation in a set M isamap py: MxM — M ; it is associative or commutative 
if p(y(a, b),c) = p(a,y(b,c)) or y(a,b) = y(b,a), respectively, holds for all a,b,c eM. 
A set G equipped with an associative binary operation is called a group if there exist the 
unit element e€G, (g,e) = y(e,g) = g for any g € G, and the inverse element 
g-' EG toany gEG, (9,97!) = p(g71g) =e. 

Consider next a set R equipped with two binary operations, which we call summation, 
Ya(a,b) = a+b, and multiplication, ~m(a,b) := ab. The triplet (R,Ya,~m) is a ring if 
(R, a) is a commutative group and the two operations are distributive, a(b+c) = ab+ac 
and (a+b)c = ac + bc for all a,b,c € R. If there is an e € R such that ae =ea =a 
holds for all a € R, we call it the unit element of R. 
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Let A be a vector space over a field F’. The vector summation gives it the structure- 
of a commutative group; if we define a multiplication which is distributive with the sum- 
mation and satisfies a(ab) = (aa)b = a(ab) for any a,b € A, a € @, then A becomes 
a ring, which we call a linear algebra over the field F,, in particular, a real or complex 
algebra if F = R or F =(, respectively. An algebra is said to be associative if its 
multiplication is associative. The term “algebra” without a further specification always 
means a complex associative algebra in what follows; we should stress, however, that many 
important algebras are nonassociative, e.g., the Lie algebras discussed in Sec.B.3 below. 
An algebra is Abelian or commutative if its multiplication is commutative. 

A subalgebra of an algebra A is a subset B, which is itself an algebra with respect 
to the same operations. If A has the unit element, which is not contained in B, then 
we can extend the subalgebra to B := {ae +b: a € €,b € B} ; in a similar way, 
any algebra can be completed with the unit element by extending it to the set of pairs 
[a,a], a € ©,a € A, with the appropriately defined operations. A proper subalgebra 
BCA is called a (two-sided) ideal in A if the products ab and ba belong to B for 
all a € A,b € B; we define the left and right ideal analogously. A trivial example of an 
ideal is the zero subalgebra {0} C A. The algebra A itself is not regarded as an ideal; 
thus no ideal can contain the unit element. A mazimal ideal in A is such that it is not a 
proper subalgebra of another ideal in A ; any ideal in an algebra with the unit element is 
a subalgebra of some maximal ideal. An algebra is called simple if it contains no nontrivial 
two-sided ideal. The intersection of any family of subalgebras (ideals, one-sided ideals) 
in A is respectively a subalgebra (ideal, one-sided ideal), while the analogous assertion 
for the unions is not valid. 

Let A be an algebra with the unit element. We say that an element a € A is invertible 
if there exists an inverse element a~1 € A such that a~'a = aa~! =e ; we define the 
left and right inverse in the same way. For any a € A there is at most one inverse; an 
element is invertible iff it belongs to no one-sided ideal of the algebra A, which means, in 
particular, that in an algebra without one-sided ideals any nonzero element is invertible. 
Recall that a field is a ring with the unit element which has the last named property; the 
examples are IR, € or the noncommutative field @ of quaternions. 

We define the spectrum of a € A as the set o4(a) := {A: (a—Ae)~! does not exist }. 
The complement p,(a) := € \ o4(a) is called the resolvent set; its elements are regular 
values for which the the resolvent ra(A) := (a—Ae)~! exists. 


B.1.1 Proposition: Let A be an algebra with the unit element; then 

(a) if a, ab are invertible, b is also invertible. If ab,ba are invertible, so are a and b, 

(b) if ab =e, the element ba is idempotent but it need not be equal to the unit element 
unless dim A < oo, 

(c) if e—ab is invertible, the same is true for e—ba, 

(d) o,(ab) \ {0} = o,4(ba) \ {0}, and moreover, o4(ab) = o.4(ba) provided one of the 
elements a,b is invertible, 

(e) o4(a~!) = {A7!:  € o4(a) } holds for any invertible ae A. 


For any set S C A we define the algebra Ag(S) generated by S as the smallest 
subalgebra in A containing S ; it is easy to see that it consists just of all polynomials 
composed of the elements of S without an absolute term. We say that S is commutative 
if ab = ba holds for any a,b € S ; the algebra Ao(S) is then Abelian. A maximal Abelian 
algebra is such that it is not a proper subalgebra of an Abelian subalgebra; any Abelian 
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subalgebra in A can be extended to a maximal Abelian subalgebra. We also define the 
commutant of a set SC A as S’:={a€ A: ab=ba, b€ S}; in particular, the center is 
the set Z := A’. We define the bicommutant S” := (S’)! and higher-order commutants 
in the same way. 


B.1.2 Proposition: Let S,7 be subsets in an algebra S ; then 

(a) S’ and S” are subalgebras containing the center Z, and also the unit element if A 
has one. Moreover, S’=S" =--- and S”=S!" =.-.-., 

(b) the inclusion S C T implies S’D 7’, 

(c) S CS”, and S is commutative iff S Cc S’, which is further equivalent to the 
condition that S” is Abelian, 

(d) Ao(S)’ = S! and Ao(S)” = S”, 

(e) a subalgebra BC A is maximal Abelian iff B=’; in that case also B = B”. 


Let us turn to algebras with an additional unary operation. Recall that an involution 
a++a* ona vector space A is an antilinear map A — A such that (a*)* = a holds for all 
a € A; an involution on an algebra is also required to satisfy the condition (ab)* = b*a* 
for any a,b € A. An algebra equipped with an involution is called an involutive algebra 
or briefly a *—algebra. A subalgebra in A, which is itself a *—algebra w.r.t. the same 
involution, is called a *-subalgebra; we define the *—ideal in the same way. The element 
a* is said to be adjoint to a. Given a subset S C A we denote S* := {a*: ae S} ; 
the set S is symmetric if S* = S ; in particular, an element a fulfilling a* = a is called 
Hermitean. By A§j(S) we denote the smallest *-subalgebra in A containing the set S. 


B.1.3 Proposition: Let A bea *—algebra; then 

(a) any element is a linear combination of two Hermitean elements, and e* = e provided 
A has the unit element, 

(b) a* is invertible iff a is invertible, and (a*)~! = (a—')*, 

(c) a4(a*) = o,4(a) holds for any ac A, 

(d) a subalgebra B C A is a *-subalgebra iff it is symmetric; the intersection of any 
family of *-subalgebras ( *-ideals) is a *~subalgebra ( *—ideal), 

(e) any *-ideal in A is two-sided, 

(f) A§(S) = Ao(S US*) holds for any subset S C A; if S is symmetric, then S’ and 
S” are *-subalgebras in A. 


B.1.4 Example (bounded-operator algebras): The set B(H) with the natural algebraic 
operations and the involution B++ B* is a *-algebra whose unit element is the operator 
I. Let us mention a few of its subalgebras: 

(a) if & is a nontrivial projection, then {EB : B € B(H)} is a right ideal but not a 
«subalgebra; on the other hand, { EBE : B € B(H)} is a *-subalgebra but not 
an ideal, 

(b) if dimH = oo, the sets K(H) D Jo(H) D> Ti(H) of compact, Hilbert-Schmidt, and 
trace-class operators, respectively, are ideals in B(H) ; similarly Jp(H) is an ideal 
in any J,(H), q>p, etc., 

(c) the algebra Ao(B) generated by an operator B € B(H) consists of all polynomials 
in B without an absolute term. It is a *-algebra if B is Hermitean, while the 
opposite implication is not valid; for instance, the Fourier-Plancherel operator F 
is non—Hermitean but Ao(F) is a *-algebra because F? = F-! = F*. 


B.2. BANACH ALGEBRAS 547 


The algebras of bounded operators, which represent our main topic of interest, inspire 
some definitions. We have already introduced the notions of spectrum and hermiticity; 
similarly an element a € A is said to be normal if aa* = a*a, a projection if a* =a =a’, 
and unitary if a* =a~!, etc. Of course, we also employ other algebras than B(H) and 
its subalgebras, e.g., the Abelian *—algebra C(M) of continuous complex functions on a 
compact space M with natural summation and multiplication, and the involution given 
by complex conjugation, (f*)(ax) := f(z). 

An ideal 7 in an algebra A is a subspace, so we can construct the factor space A/7 . 
It becomes an algebra if we define on it a multiplication by 46 := ab, where a,b are any 
elements representing the equivalence classes @ and 6; it is called the factor algebra (of 
A w.r.t. the ideal 7). If A has the unit element, then the class 6:= {e+c: cE J} is 
the unit element of A/7. 

A morphism of algebras A, B is a map y: A — B which preserves the algebraic 
structure, i.e, y(aa + 6) = ay(a) + y(b) and y(ab) = y(a)y(b) holds for all a,b € 
A, a€C. In particular, if y is surjective, then the image of the unit element (an ideal, 
maximal ideal, maximal Abelian subalgebra) in A is respectively the unit element (an 
ideal, ...) in B. If y is bijective, we call it an isomorphism; in the case A = B one 
uses the terms endomorphism and automorphism of A, respectively. The null-space of a 
morphism ¢ is the pull-back y~1(0g) of the zero element of the algebra B ; it is an ideal 
in A. If A, B are *-algebras and y preserves the involution, y(a*) = y(a)*, it is called 
*—morphism. 


B.1.5 Example: Let 7 bea (*-—)ideal in a ( *—)algebra A; then the map y,: y-(a) =G@ 
is a (*-)morphism of A to A/7Z. It is called a canonical morphism; its null-space is just 
the ideal 7. The factor algebra A/JZ is simple iff the ideal 7 is maximal. Moreover, 
any (*-)morphism y : A — B can be expressed as a composite mapping, y = To, 
where y is the canonical morphism corresponding to the (+*-)ideal 7 := y~'(0g) and 
m: A/J — p(A) is the ( *—)isomorhism defined by 7(@) := y(a) for any ac A. 


Let us finally recall a few notions concerning representations. This term usually means 
the mapping of an algebraic object onto a suitable set of operators, which preserves the 
algebraic structure. We shall most often (but not exclusively) use representations by 
bounded operators: by a representation of a (*—)algebra A we understand in this case a 
(*«-)morphism 7: A— B(H), the space # is called the representation space and dimH 
the dimension of the representation 7. If the morphism 7 is injective, the representation 
is said to be faithful. Representations 7; : A — B(H;), j = 1,2, are equivalent if there 
is a unitary operator U : H,; — He such that mo(a)U = Um(a) holds for any a € A. 
A representation 7: A — B(H) is called irreducible if the operator family (A) has no 
nontrivial closed invariant subspace. A vector x € H is cyclic for the representation 7 
if the set 1(A)x := {x(a)x : a € A} is dense in H. Representations of groups, Lie 
algebras, etc., are defined in the same way. 


B.2 Banach algebras 
Algebras can be equipped with a topological structure. Suppose that an algebra A is 


at the same time a locally convex topological space with a topology 7 ; then we call it a 
topological algebra if the multiplication is separately continuous, i.e., the maps a++ ab and 
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a‘ ba are continuous w.r.t. the topology 7 for any fixed 6€ A. A subalgebra BC A 
is closed if it is closed as a subset in A. The closed subalgebra A(S) generated by a set 
S C A is the smallest closed subalgebra in A containing S. Isomorphisms py: A — B 
of topological algebras can be classified according to their continuity: the algebras A, B 
are topologically isomorphic if there is a continuous isomorphism y such that yp! is also 
continuous. 


B.2.1 Proposition: Let A be a topological algebra and S, B its subset and subalgebra, 
respectively; then 
(a) B is a closed subalgebra in A. If B is Abelian, the same is true for B, and any 
maximal Abelian subalgebra is closed, 
(b) A(S) = An(S), 7 
(c) the subalgebras S’, S” are closed and (S)' = 5S’, 
(d) if B is an ideal, B 4 A, then B is also an ideal in A. Any maximal ideal is closed, 
(e) the null-space of a continuous morphism yp: A — C is a closed ideal in A. 


There are various ways how of defining a topology on an algebra. 


B.2.2 Example: The strong and weak operator topologies on B(H) are both locally 
convex, and the operator multiplication is separately continuous with respect to them 
(compare with Theorem 3.1.9 and Problem 3.9); thus B,(H) and By(#) are topological 
algebras. 


One of the most natural ways is to introduce a topology by means of a norm. An 
algebra A is called a normed algebra provided 


(i) A is a normed space with a norm || - ||, 
(ii) |labl| < |la|| [||| for any a,be A, 
(iii) if A has the unit element, then |le|] =1. 


The last condition may be replaced by |le|| < 1 because the opposite inequality follows 
from (ii). The multiplication in a normed algebra is jointly continuous. If A is complete 


w.r.t. the norm || - ||, it is called a Banach algebra. We can again assume without 
loss of generality that a normed algebra A has the unit element; otherwise we extend 
it in the above described way, defining the norm by ||[a, a]|| := |a| + |lajl4. If J is a 


closed ideal in a Banach algebra A, then A/JZ is a Banach algebra w.r.t. the norm 
llal| := infrez ||a—blla. 

A complete envelope of a normed algebra A is a Banach algebra B such that it 
contains A as a dense subalgebra and |la||4 = |lal|g holds for any a € A. 


B.2.3 Theorem: Any normed algebra A has a complete envelope, which is unique up 
to an isometric isomorphism preserving the elements of A. 


The space B(H) equipped with the operator norm provides an example of a Banach 
algebra; the complete envelope of any subalgebra B C B(H) is its closure B. By a 
straightforward generalization of the methods of Section 1.7, we can prove the following 
assertions. 


B.2.4 Theorem: Let A be a Banach algebra with the unit element; then 
(a) any element a € A fulfilling |ja—e|| < 1 is invertible. The set R of all invertible 
elements in A is open and the map a+ a~! is continuous in it, 
(b) the resolvent set p,(a) of any element a € A is open in @ and the resolvent 
Ta: PA A is analytic, 
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(c) the spectrum o4(a) of any element a € A is a nonempty compact set, 
(d) the spectral radius r(a) := sup{|A|: A € o4(a)} is independent of A and equals 


= if nyl/n — ; nyjl/n, 
r(a) = lim |la”|| inf la" ||" ; 
it does not exceed the norm, r(a) < |lall . 


The independence feature of part (d) is not apparent in the case of bounded operators, 
where the spectral quantities are related to a single algebra B(H). To appreciate this 
result, notice that the spectrum, and in particular its radius, is a purely algebraic property, 
while the right side of the formula depends on the metric properties of the algebra A. 


B.2.5 Proposition: Let A, B be Banach algebras; then 

(a) a morphism y: A — B is continuous iff there is C such that ||y(a)lls < Cllall4 
for any a € A; if y is a continuous isomorphism, then the algebras A, B are 
topologically isomorphic, 

(b) if A, B are complete envelopes of normed algebras Ap, By, then any continuous 
morphism Yo : Ag — Bo has just one continuous extension y: A— B, 

(c) if 7 is the null-space of a continuous surjective morphism y : A — B, then A/7 
and B are topologically isomorphic. 


An isomorphism y: A = B is called isometric if ||p(a)||g = |lal|4 holds for all ac A. 


B.2.6 Theorem (Gel’fand—Mazur): A Banach algebra with the unit element, in which 
any nonzero element is invertible, is isometrically isomorphic to the field @ of complex 
numbers. 


B.3 Lie algebras and Lie groups 


A Lie algebra (real, complex, or more generally, over a field F’) is a finite-dimensional 
(nonassociative) linear algebra L with the multiplication conventionally denoted as (a,b) > 
{a, 6], which is antisymmetric, [a,b] = —[b, a], and satisfies the Jacobi identity (a, [b, cl] + 
(b, {c, al] + [c, [a, b]] = 0 for any a,b,c € L. The dimension of L is its vector-space 
dimension; if {e;}7_, is a basis in L, the product is fully determined by the relations 
[exnex) = Reis where the coefficients ci, are called the structure constants (one uses the 
summation convention, according to which the sum is taken over any repeated index). A 
complex extension Le is a complex extension of L as a vector space with the product 
(a1 + iby, @2+ iba] := [a1, a2] — [b1, be] + t[a1, ba] + [b1, a9] . 

A Lie algebra is commutative if [a,b] = 0 for all a,b € L, i.e, all the structure 
constants are zero; this definition differs from the associative case. In contrast, other defi- 
nitions like those of a subalgebra, ideal, and also morphisms, representations, etc., modify 
easily for Lie algebras. 


B.3.1 Examples: Consider the following matrix algebras: 

(a) The set of all n xn real matrices forms an n?-dimensional real Lie algebra called the 
general linear algebra and denoted as gl(n, IR) ; its subalgebra sl(n, JR) consists of 
all traceless g € gl(n, IR). Its complex extension is the algebra gl(n,€) of nxn 
complex matrices; similarly traceless complex matrices form the algebra sl(n,C) , 
which is often also denoted as An-1. 


550 APPENDIX B. SOME ALGEBRAIC NOTIONS 


(b) The algebra gl(m,@) has other Lie subalgebras. A matrix g is said to be skew- 
symmetric if g' = —g, where g* denotes the transposed matrix of g (this property 
should not be confused with antihermiticity). The set of skew-symmetric m x 
m matrices forms the orthogonal Lie algebra, which is denoted as o(m,@) ; we 
alternatively speak about the algebras B, and Dp for o(2n+1,@) and o(2n,C), 
respectively. On the other hand, consider the subset in gl(2n,€') consisting of 
matrices such that g’jon + jong =0, where jon =( 4 e ) and e, isthenxn 
unit matrix. The corresponding Lie algebra is called symplectic and denoted as 


sp(n,C) or Ch. 


Given a Lie algebra L, we define the subalgebras L'") by the recursive relations 
L© := L and L(+) := [L™, L) ; similarly we define Lin) by Lio) := L and Lin4i) := 
[Lin), L] . The algebra is solvable if L(") = {0} for some n ; it is nilpotent if Lin) for some 
n. Any nilpotent algebra is solvable. A commutative Lie algebra is, of course, nilpotent; 
a less trivial example is the Heisenberg—Wey] algebra, which is nilpotent of order two. 

On the other hand, a Lie algebra L is semisimple if it has no commutative ideal; it 
is simple if it has no (nontrivial) ideal at all. An equivalent characterization leans on the 
notion of the Cartan tensor g: grs = ci,ck;, through which one defines the Killing form 
LxL— F by (a,b) := g,ysa"b’. The algebra L is semisimple iff its Killing form is 
nondegenerate, i.e., detg #0. For real Lie algebras, the Killing form may be used to 
introduce the following notion: JL is compact if the form is positive, and noncompact 
otherwise. A compact DL can be expressed as L= Z@S=Z@S,@---@Sn, where Z 
is its center, S is semisimple, and S,...,5, are simple algebras. 

Simple Lie algebras allow a full classification. It appears that, up to an isomorphism, 
complex simple algebras are almost exhausted by the types An, Bn,Cn, and Dy, listed 
in Example 1; there are just five more simple Lie algebras called exceptional. For small 
values of nm, some of these algebras are isomorphic, namely A; ~ By ~ C,, Bo ~ Do, 
and A3 ~ D3, while D2 is semisimple and isomorphic to A; @ A; . One can classify real 
forms of simple complex algebras in a similar way. 

The notion of a group was introduced above; for simplicity we shall here denote the 
group operation as a multiplication. A subgroup of a group G is a subset H C G which 
is itself a group w.r.t. the same operation. H is a left invariant subgroup if hg € H holds 
for any h € H and g © G. We define the right invariant subgroup in a similar way; a 
subgroup is invariant provided it is left and right invariant at the same time (these notions 
play a role analogous to ideals in algebras). The notions of the direct product of groups 
and a factor group are easy modifications of the above discussed algebraic definitions. 

A topological group is a group G, which is simultaneously a T, topological space such 
that the map g++ g~! is continuous and the group multiplication is jointly continuous. An 
isomorphism of topological groups is a map which is a group isomorphism and, at the same 
time, a homeomorphism of the corresponding topological spaces. One introduces various 
classes of topological groups according to the properties of G as a topological space, e.g., 
compact groups or locally compact groups. In a similar way, one defines a connected group. 
If G is not connected, it can be decomposed into connected components; the component 
containing the unit element is a closed invariant subgroup. 


B.3.2 Examples: (a) The group 7, of translations of the Euclidean space JR” is a 
commutative topological group, which is locally but not globally compact. 
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(b) The orthogonal group O(n) consists of real orthogonal n x n matrices, i.e., such 
that gg = e. It is compact and has two connected components specified by the 
conditions det g = +1 ; the connected component of the unit element is the rotation 
group denoted as SO(n). 

(c) The group U(n) of unitary complex nxn matrices is locally compact and connected; 
the same is true for its subgroup SU(n) of matrices with detg =1. 

(d) The substitution operators U, of Example 3.3.2 form a group. If we equip the set 
of mappings y with the metric 0(~,¢) := suPzeRn |p(r) —G(z)|, it becomes a 
topological group which is not locally compact. 


A topological group can also be equipped with a measure. The easiest way to introduce 
it is through linear functionals — cf. the concluding remark in Section A.2. Consider such 
‘a functional 4 on the space Co(G) of continuous functions with compact supports which 
is positive, i.e., u(f) > 0 holds for all f > 0. If it satisfies u(f(g~!-)) = u(f) for any 
f € Co(G) and g €G,, it is called a left Haar measure on G. On a locally compact group 
G, there is always a left Haar measure and it is unique up to a multiplicative constant. 
We introduce the right Haar measure in a similar way; a measure on G is said to be 
invariant if it combines the two properties. 

An important class of topological groups consists of those which allow a locally Eu- 
clidean parametrization. To be more precise, the notion of an analytic manifold is needed. 
This is a Hausdorff space M together with a family of pairs (Ua, ~a), aE 1, where Ua 
is an open set in M and Yq is a homeomorphism U, — JR” for a fixed n with the 
following properties: U,¢;Ua = M and for any a, f € I the component functions of the 
map ygoy,) are real analytic on Ya(Ua Ug). The number n is called the dimension 
of the manifold; replacing IR” by @™” in the definition we introduce complex analytic 
manifolds in the same way. 

Given analytic manifolds M, N, one can associate with their topological product 
(M x N,tuxn) the family of pairs (Ua x Vs, Ya X Wg). The obtained structure again 
satisfies the above conditions; we call it the product manifoldof M and N. The dimension 
of the product manifold is m=my+my. A map M — N is analytic if the component 
functions of all the maps ygopz! are analytic on their domains. 

A group G is called a (real or complex) Lie group if it is an analytic manifold (real 
or complex, respectively) and its multiplication and inversion as maps G x G — G and 
G — G, respectively, are analytic. For instance, the groups of Examples 2a-c belong to 
this class; this is not true for the group of Example 2d, where the dimension of the group 
manifold is infinite. Any Lie group is locally compact. A subgroup of G which is itself a 
Lie group with the same multiplication is called a Lie subgroup. 

As an analytic manifold, at the vicinity of any point a Lie group admits a description 
through local coordinates defined by the corresponding map %,. This concerns, in parti- 
cular, the unit element e: there is a neighborhood U of the point 0 € JR” where we can 
parametrize the group elements by g = (91,---,9n). The composition law of G is then 
locally expressed by real analytic functions f;, 7 =1,...,n, from U x U to JR” so that 
(gh); = = f(91,-- +) 9n,hi,.. +s An) ; the consistency requires f;(9,0) = = hh F£5(0, h) = = hj 
and (Of; /29%)¢o, o) = (9f; /Ohe) (0,0) = 5jk. The structure constants of the group G' are 
defined by ci, := (0? f*/09;0h; — & f*/Oh;Og9x) (0,0) ; they satisfy the same conditions as 
the structure constants of Lie algebras. 

This is not a coincidence; there is a close connection between Lie groups and Lie 
algebras. Let U be the neighborhood of the parameter-space origin used above and 
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consider the space C®(U). The operators T; : (Tj¢)(g) = (06/99;)(99;*)| g=o are then 
well-defined and span a Lie algebra L which is said to be associated with G. The 
correspondence extends to subalgebras: if H is a Lie subgroup of G, then its Lie algebra 
M is asubalgebra of L. Moreover, if H is an invariant subgroup, then M is an ideal in 
L, etc. On the other hand, the association G++ L is not injective. 


B.3.3 Examples: (a) The group of rotations (translations on a circle) SO(2) can be 
expressed as SO(2) = 7;/Z , where T; means translations on a line and Z is the 
additive group of integers. Both SO(2) and T; have the same (one-dimensional) 
Lie algebra so(2). 

(b) Let Z2 be the two-point group {0,1} with the addition modulo 2. The Lie groups 
SU(2) and SO(3) = SU(2)/Z2 have the same Lie algebra so(3) as was discussed 
in Example 10.2.3e. 


Basic notions concerning the representation theory of Lie groups and algebras can be 
readily adapted from the preceding sections. The correspondence discussed above induces 
a natural relation between some representations of a Lie group and those of its Lie algebra; 
in the simplest case of a one-dimensional G this is the content of Stone’s theorem. In 
general, however, the representation theory of Lie groups and algebras is a complicated 
subject which goes beyond the scope of the present book; we refer the reader to the 
literature quoted at the beginning of the appendix. 
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